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ENZYMATIC PROCESSES FOR THE PRODUCTION OF 
4-SUBSTITUtED 3-HYDROXYBUTYRIC ACID DERIVATIVES 
AND VICINAL CYANO, HYDROXY SUBSTITUTED CARBOXYLIC ACID ESTERS 

CROSS-REFERENCE TO RELATED APPLICATION 
This application is a continuation in part of U.S.S.N. 10/639,159, filed August 11, 
2003, which claims the benefit under 35 U.S.C. § 1 19(e) of U.S.S.N. 60/402,436, filed 
August 9, 2002, both of which are incorporated herein by reference in their entireties. 



COPYRIGHT NOTIFICATION 
A portion- of the disclosure of this patent document contains material which is subject 
to copyright protection. The copyright owner has no objection to the facsimile' reproduction 
by anyone of the patent document or patent disclosure, as it appears in the Patent and 
15 Trademark Office patent file or records, but otherwise reserves all copyright rights 
whatsoever. 

FIELD OF THE INVENTION 
The present invention relates to novel enzymatic methods and compositions for 
20 preparing 4-substituted 3-hydroxybutyric acid derivatives and vicinal cyano, hydroxy 
substituted carbdxylic acid esters, 

BACKGROUND 

4-substituted 3-hydroxybutyric acid derivatives and vicinal cyano, hydroxy 
• 25 substituted carboxylic acid esters are commercially important intermediates in the synthesis 
of pharmaceuticals. Nonracemic chiral 4-substituted 3-hydroxybutyric acid esters may be 
utilized in the synthesis of HMG-CoA reductase inhibitors, such as atorvastatin, fluvastatin, 
rosuvastatin, and itavastatin. For example, an ester of (R)-4-cyano-3-hydroxybutyric acid 
and an ester of (3R,5R)-6-cyano-3,5-dihydroxyhexanoic acid are key intermediates for the . 
30 production of "the cholesterol lowering agent atorvastatin. Methods have been described for 
producing certain 4-substituted 3-hydroxybutyric acid esters. Isbell, et al., Carbohydrate 
Res,, 72:301 (1979), report a method for synthesizing an (R)-4-cyano-3-hydroxybutyric acid 
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ester by reacting the monohydrate calcium salt of threonine with hydrogen bromide to 
produce a dibromo derivative of threonine, which is then converted to a vicinal bromohydrin. 
The hydroxyl group of the bromohydrin is protected prior to reaction with sodium cyanide. 
Id 

5 ActaChem. Scand., B37, 341 (1 983) reports a method for producing a 4-cyano- 

3- hydroxybutyrate from a 4-bromo-3-hydroxybutyrate that requires protecting the hydroxy 
group with a protecting group prior to reaction with sodium cyanide. Recent routes to 
synthesize 4-cyano-3-hydroxybutyrate esters involve the uncatalyzed chemical reaction of a 

4- bromo- or 4-chloro- 3-hydroxybutyrate ester, without protection of the hydroxyl group, 

1 0 with a cyanide salt. By-products, however, are formed under the basic conditions created by 
the basic cyanide anion, which are particularly problematic to remove from the product. 
4-Cyano-3-hydroxybutyrate esters are high boiling liquids and vacuum fractional distillation 
is required to separate the 4-cyaho-3-hydroxybutyrate ester from these by-products. The 
distillation conditions are prone to generate additional by-products and the distillation is 

15 troublesome to operate successfully. 

The use of a 4-chloro-3-hydroxybutyric acid ester as a starting material in the 
synthesis of a 4-cyano-3-hydroxybutyric acid ester is more economically attractive than the 
use of a 4-bromo-3-hydroxybutyric acid ester, but requires more forcing conditions in its 
reaction with cyanide salts due to the lower reactivity of the chloro substituent compared to 

20 the bromo substituent. While the cyanation of 4-chIoro-3-hydroxybutyrate esters proceeds 
with alkali cyanide and high temperature, these forcing conditions lead to substantial by- 
product formation, requiring extensive isolation and purification procedures that result in 
additional yield loss. U.S. Pat. No. 5,908,953 discloses that, besides unreacted starting 
material, crude lower alkyl esters of (R)-4-cyano-3-hydroxybutyric acid may contain 

25 hydroxyacrylate, cyanoacrylate, 3-cyanobutyrolactone, 3-hydroxybutyrolactone, y- 

crotonolactone, 3-cyano-4-hydroxybutyrate lower alkyl ester, 3,4-dicyanobutyrate lower 
alkyl ester and high-boiling uncharacterized compounds. U.S. Pat. No. 5,908,953 further 
describes a purification method for lower alkyl esters of (R)-4-cyano-3-hydroxybutyric acid 
that involves distillation of a crude mixture in the presence of a solvent that has a boiling 

JO point of 50°C to 160°C at 10 Torr. Using such distillation methods, the decomposition of 
unreacted starting material is said to be minimized, which otherwise can result in a dramatic 
overall loss in (R)-4-cyano-3-hydroxybutyric acid lower alkyl ester production. U.S. Pat. No. 
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6,140,527 describes an alternative approach for treating crude lower alkyl esters of (R)- 
4-cyano-3-hydroxybutyric acid that involves removal of the dehydrated by-products, such as 
4-hydroxycrotonic acid esters, by chemical reaction, which renders these components water 
soluble and extractable. Thus, although these methods utilize a readily available starting 
5 material, significant yield loss and product purification requirements make them 

commercially undesirable. Accordingly, more efficient methods for producing nonracemic 
chiral 4-substituted 3-hydroxybutyric acid esters under milder conditions would be highly 
desirable. 

Halohydrin dehalogenases, also referred to as haloalcohol dehalogenases or 

10 halohydrin hydrogen-halide lyases, catalyze the elimination of hydrogen halide, as proton and 
halide ion, from vicinal halohydrins to produce the corresponding epoxide. These enzymes 
also catalyze the reverse reaction. Nagasawa et al., Appl. Microbiol. Biotechnol . vol; 36 
(1992) pp. 478-482, disclose activity of a certain halohydrin hydrogen-halide lyase on 
4-chloro-3-hydroxybutyronitrile among other vicinal halohydrins. Nakamura et al., Biochem. 

15 Biophvs. Research Comm . vol. 180 (1991) pp. 124-130 and Tetrahedron vol. 50 (1994) 
pp 1 1821-1 1826, disclose activity of a halohydrin hydrogen-halide lyase to catalyze the 
reaction of certain epoxides with cyanide to form the corresponding. beta-hydroxynitriles. In 
these references and U.S. Patent 5,2 1 0,03 1 , Nakamura et al. disclose a reaction of 
epihalohydrin with alkali cyanide in the presence of a certain halohydrin hydrogen-halide 

20 lyase to produce the corresponding 4-halo-3-hydroxy-butyronitrile. In U.S. Patent No. ■ 

5,166,061, Nakamura et al. disclose a reaction of a l,3-dihalo-2-propanol with alkali cyanide 
in the presence of certain dehalogenating enzymes to produce the corresponding 4-halo- 
3-hydroxybutyronitrile. In Tetrahedron vol. 50 (1 994) pp 1 1 S2 1 - 1 1 S26, Nakamura et al. 
disclose the reaction of l,3-dichloro-2-propanol with cyanide using a purified halohydrin 

25 hydrogen-halide lyase to produce 4-chloro-3-hydroxybutyronitrile. 

Lutje-Spelberg et al., Org. Lett., vol. 2 (2001) pp 41-43, discloses activity of a 
halohydrin dehalogenase to catalyze the reaction of certain styrene oxides with azide to form 
the corresponding l-phenyl-2-azido-ethanol. 
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SUMMARY OF THE INVENTTTYM 
In one aspect, the present invention is directed to a method for producing a 4-cyano-3- 
hydroxybutyric acid ester or amide from a 4-halo-3-hydroxybutyric acid ester or amide, the 
method comprising: 
5 (a) . providing a 4-halo-3-hydroxybutyric acid ester or amide, 

wherein the halo substituent is selected from the group consisting of chlorine, 
bromine, and iodine; and 

(b) contacting the 4-halo-3-hydroxybutyric acid ester or amide with a halohydrin 
dehalogenase and cyanide under conditions sufficient to form a reaction mixture for 
1 0 converting the 4-halo-3-hydroxybutyric acid ester or amide to a 4-cyano-3-hydroxybutyric 
acid ester or amide. 

In a further aspect of the present invention, the 4-halo-3-hydroxybutyric acid ester or 
amide in step (a) is provided by a method comprising: 

providing a 4-halo-3-ketobutyric acid ester or amide, 
1 5 wherein the halo substituent is selected from the group consisting of chlorine, 

bromine, and iodine; and 
. contacting the 4-halo-3-ketobutyric acid ester or amide with a ketoreductase, a 
cofactor, and a cofactor regeneration system under conditions sufficient to form a reaction 
mixture for converting the 4-halo-3-ketobutyric acid ester or amide to the 4-halo-3- 
20 hydroxybutyric acid ester or amide. 

In another aspect, the present invention is directed to a method for producing a 4- 
cyano-3-hydroxybutyric acid ester from a 4-halo-3-ketobutyric acid ester, the method 
comprising: 

(a) providing a 4-halo-3-ketobutyric acid ester, 

25 wherein the halo substituent is selected from the group consisting of chlorine, 

bromine, and iodine; and 

(b) contacting the 4-halo-3-ketobutyric acid ester with a ketoreductase, a cofactor, a 
cofactor regeneration system, cyanide, and a halohydrin dehalogenase to form a reaction 
mixture for converting the 4-halo-3-ketobutyric acid ester to a 4-cyano-3-hydroxybutyric acid 

30 ester. 
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In another embodiment, the present invention is directed to a method for producing a 
4-nucleophile substituted-3-hydroxybutyric acid ester or amide from a 4-h'alo-3- 
hydroxybutyric acid ester or amide, the method comprising: 

(a) providing a 4-halo-3-hydroxybutyric acid ester or amide, 

wherein the halo substituent is selected from the group consisting of chlorine, 
bromine, and iodine; and 

(b) contacting die 4-halo-3-hydroxybutyric acid ester or amide with a halohydrin 
dehalogenase and a nucleophile under conditions suitable to form a reaction mixture for 
converting the 4-halo-3-hydxoxybutyric acid ester or amide to a 4-nucleophile substituted-3- 
hydroxybutyric acid or amide. 

• In a further embodiment, the present invention is directed to a method for producing a 
4-nucleophile substituted-3-hydroxybutyric acid esters or amide, the method comprising: 

(a) providing a 4-halo-3-ketobutyric acid ester or amide 

wherein the halo substituent is selected from the group consisting of chlorine, 
bromine, and iodine; and 

(b) contacting the 4-halo-3-ketobutyric acid ester or amide with a ketoreductase, a 
cofactor, a cofactor regeneration system, a nucleophile, and a halohydrin dehalogenase to 
form a reaction mixture for converting the 4-halo-3-ketobutyric acid ester or amide to a 
4-nucleophile substituted-3-hydroxybutyric acid ester or amide. 

. In another aspect, the present invention is directed to a composition comprising: 

(a) a halohydrin dehalogenase; 

(b) a nucleophile; and 

(c) a 4-halo-3-hydroxybutyric acid ester or amide. 

The present invention further provides compositions that are useful in the production 
of vicinal cyano, hydroxy substituted carboxylic acid esters , as well as additional related 
methods for converting vicinal halo, hydroxy substituted carboxylic acid esters to vicinal 
cyano, hydroxy substituted carboxylic acid esters. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 depicts the amounts of ethyl 4-chJoro-3-hydroxybutyrate (chlorohydrin) and 
ethyl 4-cyano-3-hydroxybutyrate (cyanohydrin) analyzed in test reactions of ethyl 4-chloro- 
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3-hydroxybutyrate with cyanide in aqueous solutions at various pHs in the presence or 
absence of a halohydrin dehalogenase (HHDH), as described in Example 21. 

Figure 2 depicts a 3944 bp expression vector (pCKl 1 0700) of the present invention 
comprising a p 1 5 A origin of replication (p 1 5 ori), a lad repressor, a T5 promoter, a T7 
ribosomal binding site (T7gl0), and a chloramphenicol resistance gene (camR). 

Figure 3 depicts a 4036 bp expression vector (pCKl 10900) of the present invention 
comprising a pl5A origin of replication (pi 5 ori), a lad repressor, a CAP binding site, a lac 
promoter (lac), a T7 ribosomal binding site (T7gl0 BBS), and a chloramphenicol resistance 
gene (camR). 

Figure 4 depicts the percent conversion vs. time for the reactions of ethyl 
(S)-4-chloro-3-hydroxybutyrate with aqueous hydrocyanic acid in the presence of various 
halohydrin dehalogenase enzymes that are described in Examples 25 through 29. 

DETAILED DESCRIPTION OF THF. TNVFNTTON 
The present invention provides enzymatic methods and compositions for producing 
various 4-substituted 3-hydroxybutyric acid esters and amides from corresponding 4-halo-3- 
hydroxybutyric acid ester and amide substrates. The present invention also provides methods 
and compositions for producing vicinal cyano, hydroxy substituted carboxylic acid esters 
from vicinal halo, hydroxy substituted carboxylic acids. 

• L HALOHYDRIN DEHALOGENAS E-CAT ALYZED CONVERSION QF 4-HAT.O- 
3 -H YDROXYBUT YRIC ACID DERIVATIVES 

The present invention provides a method for producing a 4-nucleophile substituted- 
3-hydroxybutyric acid ester or amide from a 4-halo-3-hydroxybutyric acid ester or amide, the 
method comprising: 

(a) providing a 4-halo-3-hydroxybutyric acid ester or amide, 

wherein the halo substituent is selected from the group consisting of 
chlorine, bromine, and iodine; and 

(b) contacting the 4-halo-3-hydroxybutyric acid ester or amide with a halohydrin 
dehalogenase and a nucleophile under conditions suitable to form a reaction mixture for 
converting the 4-halo-3-hydroxybutyric acid ester or amide to a 4-nucleophile substituted-3- 
hydroxybutyric acid ester or amide. Significantly, the invention method provides a process 
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for the manufacture of 4-substituted 3-hydroxybutyric acid esters and amides in which by- 
product formation is minimized. 

Nucleophiles suitable for use in the practice of the present invention are those that are 
capable of displacing the halo substituent of the 4-halo-34iydroxybutyric acid ester or amide 
5 substrate. Typical nucleophiles utilized in the present invention are anionic nucleophiles. 
Exemplary nucleophiles include cyanide (CN~), azide (N 3 ~), and nitrite (ONCT). 

In a specific embodiment, the present invention provides a method for producing 
4-cyano-3-hydroxybutyric acid esters or amides from 4-halo-3-hydroxybutyric acid esters or 
amides via a halohydrin dehalogenase-catalyzed reaction, the method comprising: 
1 0 (a) providing a 4-halo-3-hydroxybutyric acid ester or amide; • 

wherein the halo substituent is selected from the group consisting of 
chlorine, bromine, and iodine; and 
(b) contacting the 4-halo-3-hydroxybutyric acid ester or amide with a halohydrin . 
dehalogenase and cyanide under conditions suitable to form a reaction mixture for converting 
15 the 4-halo-3-hydroxybutyric acid ester or amide to a 4-cyano-3.-hydroxybutyric acid ester or 
amide. 

As used herein, the term "cyanide" refers to cyanide anion (CIST), hydrocyanic acid 
(HCN), and mixtures thereof. Cyanide may be provided in the form of a cyanide salt, 
typically an alkali salt (for example, NaCN, KCN, and the like), in the form of hydrocyanic 

20 acid (gaseous or in solution), or mixtures thereof. 

4-halo-3-hydroxybutyric- acid esters and amides employed in the practice of the 
present invention may be prepared according to the methods described herein, or 
alternatively, using methods that are well known to those having ordinary skill in the art. 
Such methods are described, for example, in U.S. Patent No. 5,891,685; Hallinan, et al, 

25 Biocatalvsis and.Biotransformation. 12:179-191 (1995); Russ. Chem. Rev.. 41:740 (1972); 
Kataoka, et al., Apul. Microbiol. BiotechnoL 48:699-703 (1997); and U.S. Patent No. 
5,430,171.. 

Suitable 4-halo-3-hydroxybutyric acid ester and amide substrates employed in the 
practice of the present invention include those having the structure IA and IB, respectively: 

30 
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wherein: 



X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

R 1 , R 2 , R 3 , R 4 , and R 6 are each independently selected from the group consisting of 
hydrogen, fluorine, an optionally substituted lower alkyl, an optionally substituted cycloalkyl, 
an optionally substituted lower alkenyl, an optionally substituted aryl, an optionally 
substituted arylalkyl, amino, an optionally substituted lower alkylamino, an optionally 
substituted cycloalkylamino, an optionally substituted lower alkoxy, an optionally substituted 
cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted arylalkoxy; and 

R s is selected from the group consisting of an optionally substituted lower alkyl, an ' 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
arylalkyl; and 

R 7 and R 8 are each independently selected from the group consisting of hydrogen, an 
optionally substituted lower alkyl, an optionally substituted cycloalkyl, an optionally 
substituted aryl, and an optionally substituted arylalkyl. 

"Optionally substituted" refers herein to the replacement of hydrogen with a 
monovalent radical. Suitable substitution groups include, for example, hydroxyl, alkyl, a 
lower alkyl, an alkoxy, a lower alkoxy, an alkenyl, a lower alkenyl, nitro, amino, cyano, 
halogen (i.e., halo), thio, and the like. Other suitable substitution groups include carboxy (i.e. 
a carboxylate or carboxylic acid group), carboalkoxy (i.e. an ester group), carbamide (i.e.an 
amide group), and acyl (i.e. forming a ketone), 

The term "lower alkyl" is used herein to refer to branched or straight chain alkyl 
groups having from one to about six carbon atoms that are unsubstituted or substituted, e.g., 
with one or more halo, hydroxyl or other groups, including, e.g., methyl, ethyl, propyl, 
isopropyl, n-butyl, i-butyl, f-butyl, trifluoromethyl, and the like. The term "cycloalkyl" refers 
to carbocyhc alkyl moieties having from 3 to about 6 carbon atoms, as well as heterocyclic 
alkyl moieties having from 3 to about 6 atoms, where at least one ring atom is a heteroatom, 
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and the other atoms are carbon atoms. "Heteroatom" refers herein to oxygen, nitrogen, or 
sulfur. 

The term "lower alkenyl" is used herein to refer to a branched or straight chain group 
having one or more double bonds and from 2 to about 6 carbon atoms. Lower alkenyl groups 
5 employed hi the practice of the present invention may be optionally substituted with the 
groups described herein, including, for example, halo, hydroxyl, lower alkyl, and the like. 

As used herein, the term "lower alkoxy" refers to -OR where R is a lower alkyl or a 
lower alkenyl. Suitable lower alkoxy groups employed in the practice of the present 
invention include methoxy, ethoxy, f-butoxy, trifluoromethoxy, and the like. The term ' 

1 0 "aryloxy" refers herein to RO-, where R is an aryl. As used herein, the term "aryl" refers to 
monocyclic and polycyclic aromatic groups having from 3 to about 14 backbone carbon or 
heteroatoms, and includes both carbocyclic aryl groups- and heterocyclic aryl groups. 
.Carbocyclic aryl groups are aryl groups in which all ring atoms in die aromatic ring are 
carbon. Heterocyclic aryl groups are aryl groups that have from 1 to about 4 heteroatoms as 

1 5 ring atoms in an aromatic ring with the remainder of the ring atoms being carbon atoms. 
Exemplary aryl groups employed as substituents in the present invention include, for 
example, phenyl, pyridyl, pyrimidinyl, naphthyl, and the like. 

The term "arylalkyl" refers herein to an alkyl group substituted with an aryl group. 
Exemplary arylalkyl groups. include benzyl, picolyl, and the like. Substituted arylalkyl 

20 groups may be substituted in either or both aryl and alkyl portions of the arylalkyl group. As 
used herein, the term "arylalkoxy" refers to RO- where R is an arylalkyl. 

The term "cycloalkoxy" refers herein to RO-, where R is an optionally substituted 
C 3 -Cs cycloalkyl. The term "amino" is used herein to refer to the group -NH 2 .. The term 
"lower alkylamino" refers herein to the group -NRR' where R is hydrogen or a lower alkyl, 

25 and R' is a lower alkyl. The term "cycloalkylamino" refers herein to the group -NR where R 
is an optionally substituted divalent aliphatic radical having from 3 to about 8 carbon atoms, 
so that N and R form a cyclic structure, for example, pyrollidino, piperidino, and the like. . 

Specific 4-halo-3-hydroxybutyric acid esters of compound IA that may be employed 
in the practice of the present invention include ethyl 4-chloro-3-hydroxybutyric acid ester 

30 (i.e., where X is chlorine, R 1 , R% R 3 , R 4 , and R 6 are hydrogen, and R 5 is ethyl), methyl 
4-chloro-3-hydroxybutyric acid ester (i.e., where X is chlorine, R 1 , R 2 , R 3 . R 4 and R 6 are 
hydrogen and R 5 is methyl), ethyl 4-bromo-3-hydroxybutyric acid ester (i.e., where X is 
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bromine, R 1 , R 2 , R 3 , R 4 , and R 6 are hydrogen, and R s is ethyl), methyl 4-bromo-3- 
hydroxybutyric acid ester (i.e., where X is bromine, R 1 , R 2 , R 3 , R 4 , and R 6 are hydrogen, and 
R 5 is methyl), t-butyl-4-chloro-3-hydroxybutyric acid ester (i.e., where X is chlorine, R 1 , R 2 , 
R 3 , R 4 , and R 6 are hydrogen, and R 5 is t-butyl), t-butyI-4-bromo-3-hydroxybutyric acid ester 
(i.e., where X is bromine, R 1 , R 2 , R 3 , R 4 , and R 6 are hydrogen, and R 5 is t-butyl), and t-butyl- 
4-iodo-3-hydroxybutyric acid ester (i.e., where X is iodine, R 1 , R 2 , R 3 , R 4 , and R 6 are 
hydrogen, and R s is t-butyl). In certain embodiments, at least one of R 1 , R 2 , R 3 , R 4 , and R 6 is 
a lower alkyl, such as, for example, methyl, ethyl, or propyl. 

Suitable 4-halo-3-hydroxybutyric acid amides of compound IB that maybe employed 
in the practice of the present invention include 4-chloro-3-hydroxybutyric amide (i.e., where 
X is chlorine, R 1 , R 2 , R 3 , R 4 , R 6 , R 7 , and R s are hydrogen), 4-bromo-3-hydroxybutyric amide 
(i.e., where X is bromine, and R 1 , R 2 , R 3 , R 4 , R 5 , R 7 , and R s are hydrogen), and 4-iodo-3- 
hydroxybutyric amide (i.e., where X is iodine, R 1 , R 2 , R 3 , R 4 , R°, R 7 , and R s are hydrogen, 
In certain embodiments, at least one of R 1 , R 2 , R 3 , R 4 , and R 6 is a lower alkyl, such as, for 
example, methyl, ethyl, or propyl. 

The 4-halo substituent of the 4-halo-3-hydroxybutyric acid ester and amide substrates 
is preferably selected from chlorine and bromine. Particularly preferred are 4-chloro- 
3-hydroxybutyric acid ester and amide substrates. 

4-substituted-3-hydroxybutyric acid esters and amides produced by the methods of the 
present invention include those having the structure IIA and IIB, respectively: 




IIA IIB 

where: 

R 1 , R 2 , R 3 , R 4 , R s , R 6 , R 7 , and R s are as defined for structures IA and IB; and 
Nu is selected from the group consisting of -CN, -N 3 , and -ONO. 
When 4-halo-3-hydroxybutyric acid ester substrates having the structure of 
compound IA are reacted with cyanide and halohydrin dehalogenase, 4-cyano- 
3-hydroxybutyric acid ester products are generated that have the structure of compound AL- 
IO 
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5 where R 1 , R 2 , R 3 , R 4 , R 5 , and R 6 are as defined for structure IA . 

Halohydrin dehalogenases are employed in the practice of the present invention to 
catalyze the conversion of a 4-halo-3-hydroxybutyric acid ester or amide to the • 
corresponding 4-nucleophile substituted-3-hydroxybutyric acid ester or amide in the presence 
of a nucleophile. The terms "halohydrin dehalogenase" and "HHDH" are used 

1 0 interchangeably herein to refer to an enzyme that, in the process of the present invention, 
catalyzes the conversion of a vicinal halo, hydroxy substituted carboxylic ester or amide to a 
vicinal cyano, hydroxyl substituted carboxylic ester or amide, such as, for example, 4-halo- 
3-hydroxybutyric acid ester and/or amide to a 4-nucleophile substituted-3-hydroxybutyric 
acid ester and/or amide, respectively, in the presence of a nucleophile such as cyanide. 

1 5 Suitable halohydrin dehalogenases include naturally occurring (wild type) halohydrin 

dehalogenases, as well as non-naturally occurring halohydrin dehalogenases generated by 
human manipulation. Exemplary naturally occurring and non-naturally occurring halohydrin 
dehalogenases and halohydrin dehalogenase-encoding polynucleotides include those 
described herein. 

20 Naturally occurring halohydrin dehalogenase encoding genes have been identified in 

Agrobacterium radiobacter AD1 (hheC), Agrobacterium tumefaciens * (tialB), 
Corynebacterium sp. {hheA encoding la and hhB encoding Vo),Arthrobacter sp. (JiheAmi), 
and Mycobacterium sp. GP1 (hheB G pi). See van Hylckama Vlieg, J.E.T., L. Tang, J.H. Lutje 
Spelberg, T. Smilda, GJ. Poelarends, T. Bosma, A.EJ. van Merode, M.W. Fraaije & Dick B. 

25 Janssen, "Halohydrin Dehalogenases are structurally and mechanistically related to short- 
chain dehydrogenases/reductases (2001) Journal of Bacteriology. lS3:5058-5066 (provides 
the amino acid sequences for these halohydrin dehalogenases in an alignment). 

These naturally occurring halohydrin dehalogenases have been characterized to some 
extent. HHDH from Agrobacterium radiobacter AD 1 is a homotetramer of 28 kD subunits. 
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Corynebacterium sp. N-1074 produces two HHDH enzymes, one of which is composed of 28 
kD subunits (la), while the other is composed of related subunits of 35 and/or 32 kD (Kb). 
HHDH from some sources is easily inactivated under oxidizing conditions in a process that 
leads to dissociation of the subunits, has a broad pH optimum from pH 8 to 9 and an optimal 
temperature of 50°C (Tang, Enz. Microbiol. Techno] (2002) 30:251-258; Swanson, Curr. 
Opinion Biotechnol. (1999) 10:365-369). The optimal P H for HHDH catalyzed epoxide 
formation is 8.0 to 9.0 and the optimal temperature ranges from 45 to 55°C (Van Hylckama 
Vlieg, et al., J- Bacteriol. (2001) 183:5058-5066; Nakamura, et al., Appl. Environ. Mirmhini 
(1994) 60:1297-1301; Nagasawa, et al., Appl. Microbiol. Biotechnol (1992) 36:478-482). 
The optimal pH for the reverse reaction, zing opening by chloride has been reported for the 
two Comebacterium sp. N-1074 enzymes and is 7.4 (la) or 5 (lb). Polynucleotides encoding 
the halohydrin dehalogenase from Agrobacterium radiobacter AD1 are provided herein as 
SEQ ID NOS: 13, 15, and 17. The polynucleotides corresponding to SEQ ID NOS: 13, 15, 
and 17 are variants that encode the same amino acid sequence (the translated sequences are 
provided as SEQ ID NOS: 14, 16, and 18). 

Non-naturally occurring halohydrin dehalogenases can be generated using known 
methods, including, for example, mutagenesis, directed evolution, and the like. Several 
illustrative methods are described hereinbelow. The enzymes can be readily screened for 
activity using the method described in Example 4. Such screening methods may also be 
. readily applied to identifying other naturally occurring halohydrin dehalogenases. Suitable 
non-naturally occurring halohydrin dehalogenases include those corresponding to SEQ ID 
NOS: 24 (HHDH B-03), 26 (HHDH C-04), 28 (HHDH E-01), 30 ( S01056858), 32 (HHDH 
2G5), 34 (HHDH Mzl.lA5), 36 (HHDH cysl.10), 38 (HHDH cys2.12), 74 (HHDH B-12), 
76 (HHDH Mzl/4H6), 78 (HHDH F-04), SO (HHDH A-0S), 82 (HHDH G9), 84 (HHDH F9), 
86 (HHDH H10), 8S (HHDH Al), 90 (HHDH A-03), 92 (HHDH E-03), 94 (HHDH 
S00827801), 96 (HHDH S00890554), 98 (HHDH S00994580), 100 (HHDH S01018044), 
102 (HHDH S01035939), 104 (HHDH S01009684), 106 (HHDH S00817219), 108 (HHDH 
S00708827), 110 (HHDH S00772501), 112 (HHDH S01035968), and 1 14 (HHDH 
SO 1 040430). Exemplary polynucleotide sequences that encode these halohydrin 
dehalogenases include those corresponding to SEQ ID NOS: 23, 25, 27, 29,31 33 35 37 
73, 75, 77, 79, SI, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, and 113, 
respectively. Additional non-naturallv occurring halohydrin dehalogenases that are suitable 
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for use in the practice of the present invention are provided in the patent application entitled, 
"Improved Halohydrin Dehalogenases and Related Polynucleotides," corresponding to 
Attorney Docket No. 0353.1 10US, filed on August 1 1, 2003, and assigned U.S. application 
serial number 60/494,3S2, and in the patent application entitled, "Improved Halohydrin 
5 Dehalogenases and Related Polynucleotides," corresponding. to Attorney Docket No. 
0353.2 10US, filed on February IS, 2004, and assigned U.S. application serial number 

, both of which are incorporated herein by reference in their ■ 

entireties. 

Halohydrin dehalogenases that are suitable for use in the practice of the present 

1 0 invention, whether naturally occurring or non-naturally occurring, can be readily identified 
by those having ordinary skill in the art using the method described in Example 4 and the 
vicinal halo, hydroxyl substituted substrate and nucleophile of interest.. Halohydrin 
dehalogenases employed in.the practice of the present invention typically exhibit an activity 
of at least about 1 jimol/min/mg in the assay described in Example 4. Halohydrin 

1 5 dehalogenases employed in the practice of the present invention may exhibit an activity of at 
least about 10 jimol/min/mg, and sometimes at least about 10 2 (imol/min/mg, and up to about 
10 3 jjmol/min/mg or higher, in the assay described in Example 4. 

Halohydrin dehalogenase may be provided to the reaction mixture in the forni of 
purified enzyme, cell extract, cell lysate, or whole cells transformed with gene(s) encoding 

20 halohydrin dehalogenase(s). Whole cells transformed with halohydrin dehalogenase 

encoding genes and/or cell extracts and/or cell lysates thereof may be employed in a variety 
of different forms, including solid (e.g., lyophilized, spray dried, and the like) or semi-solid 
(e.g., a crude paste). The cell extracts or cell lysates may be partially purified by 
precipitation (ammonium sulfate, polyethyleneimine, heat treatment or the like), followed by 

25 a desalting procedure prior to lyophilization (e.g., ultrafiltration, dialysis, and the like). Any 
of the cell preparations may be stabilized by crosslinking using known crosslinking agents, 
such as, for example, glutaraldehyde or immobilization to a solid phase (e.g., Eupergit C, and 
the like).' 

The solid reactants (e.g., enzyme, salts, etc.) may be provided in a variety of different 
30 forms, including powder (e.g., lyophilized, spray dried, and the like), solution, emulsion, 
suspension, and the like. The reactants can be readily lyophilized or spray dried using 
methods and equipment that are known to those having ordinary skill in the art. For example, 
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the protein solution can be frozen at -S0°C in small aliquots, then added to a prechilled 
lyophilization chamber, followed by the application of a vacuum. After the removal of water 
from the samples, the temperature is typically raised to 4°C for two hours before release of 
the vacuum and retrieval of the lyophilized samples. 

In carrying out the conversion of 4-halo-3-hydroxybutyric acid ester or amide 
substrate to the corresponding 4-nucleophile substituted-3-hydroxybutyric ester or amide 
product, the substrate is typically contacted with the halohydrin dehalogenase and 
nucleopbile in a solvent. Suitable solvents for carrying out the conversion of 4-halo-3- 
hydroxybutyric acid ester or amide to 4-nucleophile substituted-3-hydroxybutyric acid ester 
or amide include water, organic solvents (e.g. ethyl acetate, butyl acetate, 1-octanol, heptane, 
" octane, methyl t-butyl ether (MTBE), toluene, and the like), ionic liquids (e.g., 1 -ethyl 4- 
methylimidazolium tetrafluoroborate, l-butyl-3-methylimidazoliumtetrafluoroborate, 1- 
butyl-3-methylimidazolium hexafluorophosphate, and the like), and co-solvent systems, 
including aqueous co-solvent systems, and the like. Preferred solvents are aqueous solvents, 
including water and aqueous co-solvent systems. 

Exemplary aqueous co-solvent systems have water and one or more organic solvent. 
In general, an organic solvent component of an aqueous co-solvent system is selected such 
that it does not completely inactivate the enzyme catalysts employed in the invention method. 
Appropriate co-solvent systems can be readily identified by measuring enzyme activity with 
the substrate of interest in the candidate solvent system, utilizing the enzyme assay described 
in Example 4. 

The organic solvent component of an aqueous co-solvent system may be miscible 
with the aqueous component, providing a single liquid phase, or may be partly miscible or 
immiscible with the aqueous component, providing two liquid phases. Typically, when an 
aqueous co-solvent system is employed, it is selected to be biphasic, with water dispersed in 
an organic solvent, or vice-versa. Generally, when an aqueous co-solvent system is utilized, 
it is desirable to select an organic solvent that can be readily separated from the aqueous 
phase. In general, the ratio of water to organic solvent in the co-solvent system is typically in 
the range of from about 90 : 1 0 to about 1 0:90 (v/v) organic solvent to water, and between • 
30:20 and 20:80 (v/v) organic solvent to water. The co-solvent system may be pre-formed 
prior to addition to the reaction mixture, or it may be formed in situ in the reaction vessel. 
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The aqueous solvent (water or aqueous co-solvent system) may be pH-buffered or 
unbuffered. The conversion of the 4-halo-3-hydroxybutyric acid ester or amide to the 
4-nucleophile substituted-3-hydrbxybutyric acid ester or amide may be carried out at a pH of 
about 5 or above. Generally, the conversion is carried out at a pH of about 10 or below, 
5 usually in the range of from about 5 to about 10.. Typically, the conversion is carried out at a 
pH of about 9 or below, usually in the range of from about 5 to about 9. Preferably, the 
conversion is carried out at a pH of about 8 or below, usually in the range of from about 5 to 
about 8, and more preferably in the range of from about 6 to about S. This conversion may 
also be carried out at a pH of about 7.8 or below, or 7.5 or below. Alternatively, the 
1 0 conversion may be carried out a neutral pH, i.e., about 7. 

During the course of conversion, the pH of the reaction mixture may change. The pH 
of the reaction mixture may be maintained at a desired pH or within a desired pH range by 
the addition of an acid or a base during the course of conversion. Alternatively, the pH 
change may be controlled by using an aqueous solvent that comprises a buffer. Suitable 
1 5 buffers to maintain desired pH ranges are known in the art and include, for example, 

phosphate buffer, triethanolamine buffer, and the like. Combinations of buffering and acid or 
base addition may also be used. 

As described above, when conversion to 4-cyano-3-hydroxybutyric acid derivative is 
desired, the cyanide may be provided in the form of a cyanide salt, typically an alkali salt (for 
20 example, NaCN, KCN, and the like), in the form of hydrocyanic acid (gaseous or in 

solution), or mixtures thereof. Hydrocyanic acid is a weak acid. In aqueous solutions within 
several pH units of its pKa (pKa = 9.1 in water) cyanide is present as both CN~ and HCN in 
equilibrium concentrations. At pH values below about 9, cyanide is predominantly present as 
HCN. 

25 When the cyanide is provided by a cyanide salt, the reaction mixture is typically 

buffered or acidified or both to provide the desired pH. Suitable acids for acidification of 
basic cyanide salts solutions include organic acids, for example carboxylic acids, sulfonic 
acids, phosphonic acids, and the like, mineral acids, for example hydrohalic acids (such as 
hydrochloric acid), sulfuric acid, phosphoric acid, and the like, acidic salts, for example 

30 dihydrogenphosphate salts (e.g. KH 2 P0 4 ), bisulfate salts (e.g. NaHS0 4 ) and the like, as well • 
as hydrocyanic acid. The acids or acid salts used to acidify the cyanide salt may be selected 
to also provide a buffer in the resulting solution. For example, acidification with phosphoric 
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acid or a dihydrogenphosphate salt may be used to provide a phosphate buffered solution of 
HCN in the phosphate buffer range (about pH 6-8). 

When the cyanide is provided by hydrocyanic acid and a higher pH than that so 
created is desired, the reaction mixture is typically buffered or made less acidic by adding a 
base, to provide the desired pH. Suitable bases for neutralization of hydrocyanic acid are 
organic bases, for example amines, alkoxides and the like, and inorganic bases, for example, 
hydroxide salts (e.g. NaOH), carbonate salts (e.g. NaHC0 3 ), bicarbonate salts (e.g. K 2 C0 3 ),' 
basic phosphate salts (e.g. K 2 HP0 4 , Na 3 P0 4 ), and the like, as well as cyanide salts. 

For pH values below about 9, at which cyanide is predominantly present as HCN, 
equation (1) describes the halohydrin dehalogenase catalyzed reaction of a 4-halo- 
3-hydroxybutyric acid ester with the HCN in unbuffered aqueous reaction mixtures. 

OH O HHDH OHO 
X^A>- OR * HCN — ~ NC ^A QR + < H++)r > ■ 

The consumption of the hydrocyanic acid, a weak acid (pKa ~9) and release of the hydrohalic 
acid, a strong acid (pKa <0), causes the pH of the reaction mixture to drop if the aqueous 
hydrohalic acid (H + + JT) is not otherwise neutralized. The pH of the reaction mixture may 
be maintained at the desired level by standard buffering techniques, wherein the buffer 
neutralizes the hydrohalic acid up to the buffering capacity provided, or by the addition of a 
base concurrent with the course of the conversion. Such addition may be done manually 
while monitoring the reaction mixture pH or, more conveniently, by using an automatic 
.titrator as a pH stat. A combination of partial buffering capacity and base addition can also 
be used for process control. 

When the pH is maintained by buffering or by addition of a base over the course of 
the conversion, an aqueous halide salt rather than aqueous hydrohalic acid is the product of 
the overall process. For example, equation (2) represents the overall process when aqueous 
sodium hydroxide (Na + + OH") is added over the course of the reaction to maintain an initial 
pH below about 9. 
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OH O HHDH OH O 

X^AA 0R + HCN + (Na + + OH") — NC^A^A QR + (Na + + X") + H 2 0 

(2) 

In the embodiment wherein a cyanide salt is added as the base to neutralize the hydrohalic 
acid as it is produced, the neutralization regenerates HCN and maintains the total cyanide 
5 concentration (HCN + CN~) as well as the pH in the reaction mixture. This can be 
advantageous if the rate of conversion otherwise decreases as cyanide concentration 
decreases. For example, equation (3) represents the overall process when aqueous sodium 
cyanide (Na* + CN~~) is added over the course of the reaction to maintain an initial pH. While 
the cyanide is present predominantly as HCN in the reaction mixture, the HCN concentration 
1 0 is maintained while the conversion in net consumes the added basic cyanide salt. 



OH O HHDH OH O 

*^AA 0R * (Na* ♦ CM") — J- NC^AA 0R ♦ <Na* ♦ >0 



When base addition is employed to neutralize the hydrohalic acid released during the 
1 5 halohydrin dehalogenase-catalyzed reaction of a 4-halo-3-hydroxybutyrate ester or amide to a 
4-cyano-3-hydroxybutyric acid ester or amide, the progress of the conversion may be 
monitored by the amount of base added to maintain the pH. Typically bases added to 
unbuffered or partially buffered reaction mixtures over the course of conversion are added in 
aqueous solutions. 

20 When the nucleophile is the conjugate anion of a stronger acid, having a pKa 

significantly below the initial pH of the reaction solution, the nucleophile is present 
predominantly in its anionic form so that, unlike with HCN, a proton is not released on its 
reaction. Accordingly, the reaction mixture pH in reactions of such nucleophiles may be 
maintained without stoichiometric buffering or base addition. For example, the conjugate 

25 acid of azide, hydrazoic acid has a pKa of 4.7 and the conjugate acid of nitrite, nitrous acid, 
has a pKa of 3.3. Accordingly, at neutral pH, these nucleophiles are present predominantly in 
their anionic form, and ONO", respectively. That is, the neutral reaction mixture 
comprises aqueous azide and nitrite salt, respectively. Their reaction in such mixtures 
releases halide anion to form aqueous halide salt, not aqueous hydrohalic acid. 
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Those having ordinary skill in the art can readily determine the quantities of HHDH, 
4-halo-3-hydroxybutyric acid ester or amide substrate and nucleophile to use based on, for 
example, the activity of HHDH as determined by the method in Example 4, the quantity of 
product desired, and the like. To illustrate, the amount of 4-halo-3-hydroxybutyric acid ester 
or amide can be in the range of from about 10 to about 500 g/L using about 10 mg to about 
30 g of halohydrin dehalogenase. The stoichiometric amount of nucleophile can be readily 
determined. Further illustrative examples are provided herein. 

Suitable conditions for carrying out the HHDH-catalyzed conversion of the present 
invention include a wide variety of conditions which can be readily optimized by routine 
experimentation that includes contacting the HHDH, 4-halo-3-hydroxybutyric acid ester or 
amide substrate, and nucleophile at an experimental pH and temperature and detecting 
product, for example, using the methods described in the Examples provided herein. The 
HHDH-catalyzed conversion of 4-halo-3-hydroxybutyric acid ester or amide to 4-nucleophile 
substituted-3-hydroxybutyric acid ester or amide is typically carried out at a temperature in 
the range of from about 15°C to about 75°C. More typically, the reaction is carried out at a 
temperature in the range of from about 20°C to about 55°C, and typically from about 20°C to 
about 45 °C. The reaction may also be carried out under ambient conditions. 

The HHDH-catalyzed conversion of 4-halo-3-hydroxybutyric acid ester or amide to 
4-nucleophile substimted-3-hydroxybutyric acid ester or amide is generally allowed to 
proceed until essentially complete or near complete conversion of substrate. Conversion of 
substrate to product can be monitored using known methods by detecting substrate and/or 
product. Suitable methods include gas chromatography, HPLC, and the like. Yields of the 
4-nucleophile substiruted-3-hydroxybutyric acid ester or amide generated in the reaction 
mixture are generally greater than about 50%, may also be greater tiian about 60%, may also 
be greater than about 70%, may be also be greater than about S0%, and are often greater than 
about 90%. 

The 4-nucleophile substituted-3-hydroxybutyric acid ester or amide may be collected 
from the reaction mixture and optionally purified using methods that are known to those 
having ordinary skill in the art, as well as those described in the Examples. 

Preferred 4-halo-3-hydroxybutyric acid ester or amide substrates of the present 
invention are chiral, being stereogenic at the 3-position, and may be racemic or non-racemic. 
Certain halohydrin dehalogenase enzymes used in the process of the present invention 
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convert the chiral substrate to the 4-cyano-3-hydroxybutyric acid ester or amide with 
retention of the absolute stereochemistry at the stereogenic 3-position. Non-racemic chiral 
4-halo-3-hydroxybutyric acid ester or amide substrates may be converted to substantially 
equally non-racemic 4-cyano-3-hydroxybutyric acid ester or amide products with little or no 
5 loss in stereopurity. The Examples show embodiments of the invention providing high 
retention of enantiopurity. (Due to conventions for designating stereochemistry, the 
enantiomer of ethyl 4-chloro-3-hydroxybutyrate designated as (S) and the enantiomer ethyl 
4-cyano-3-hydroxybutyrate designated as (R) have the identical stereoconfiguration at the 

3- position.) . 

10- In other embodiments of the present invention, certain halohydrin dehalogenase 
enzymes maybe stereospecific for one stereoisomer of the chiral 4-halo-3-hydroxybutyric 
acid ester or amide substrate. The process of the present invention using such stereospecific 
enzymes may be used to react one stereoisomer of a stereoisomeric mixture of a 4-halo-3- 
hydroxybutyric acid ester or amide, for example a racemic mixture, while leaving the other 

1 5 stereoisomer substantially unreacted, thereby providing a kinetic resolution of the mixture. 

A further significant characteristic of the present invention is that the purity of the 

4- nucleophile substiftited-3-hydroxybutyric acid ester or amide products generated is very 
high without the need for extensive purification procedures such as vacuum distillation. 
Typically, the purity of 4-nucleophile substituted-3-hydroxybutyric acid ester or amide 

20 products generated in accordance with the methods of the present invention are at least about 
80%, usually at least about 90%, and typically at least about 95%. Product purity may be 
determined by conventional methods such as HPLC or gas chromatography. 

25 II. KETOREDUCTASE-CATALYZED PRODUCTION OF HALOHYDRINS 

The present invention further provides an enzymatic method for generating a 4-halo- 
3-hydroxybutyric acid ester or amide by: 

(a) providing a 4-halo-3-ketobutyric acid ester or amide, 

wherein the halo substituent is selected from the group consisting of chlorine, 
30 bromine, and iodine; and 

(b) contacting the 4-halo-3-ketobutyric acid ester or amide with a ketoreductase, a 
cofactor, and a cofactor regeneration system under conditions suitable to form a reaction 
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mixture for converting the 4-halo-3-ketobutyric acid ester or amide to the 4-halo- 
3-hydroxybutyric acid ester or amide. 

The terms "ketoreductase" and "KRED" are used interchangeably herein to refer to an 
enzyme that, in the process of the present invention, catalyzes the reduction of a 4-halo- 
5 3-ketobutyric acid ester or amide to the corresponding 4-haIo-3-hydroxybutyric acid ester or 
amide. Such catalytic activity may be detected in an assay such as that described in Example 
4, hereinbelow. 

As used herein, the term "cofactor" refers to a non-protein compound that operates in 
combination with an enzyme which catalyzes the reaction of interest. Suitable cofactors 

1 0 employed in the practice of the present invention include NADP + (nicotinamide-adenine 
dinucleotide phosphate), NADPH (i.e., the reduced form of nicotinamide adenine 
dinucleotide phosphate), NAD 4 " (i.e., nicotinamide adenine dinucleotide), and NADH (i.e., 
the reduced form of NAD*), and the like. The reduced foim of the cofactor is regenerated by 
reducing the oxidized cofactor with a cofactor regeneration system. 

15 In the present process, the ketoreductase catalyzes the reduction of the 4-halo- 

3-ketobutyric acid ester or amide by the reduced form of the cofactor. Equation (4) describes 
the ketoreductase-cataiyzed reduction of a.4-halo-3-ketobutyric acid ester by NADH or 
NADPH, which are represented as alternatives by the designation NAD(P)H. 

O O OH O 

>0 x^A 0R+nad(p)h + h + is™ x^A OR+NAD(Pr 

(4) 

Ketoreductases that are suitable for carrying out the reduction of 4-halo-3-ketobutyric 
acid ester or amide to 4-halo-3-hydroxybutyric acid ester or amide include both naturally 
occurring ketoreductases, as well as non-naturally occurring ketoreductases generated by 
15 human manipulation Exemplary naturally occurring and non-naturally occurring 

ketoreductases and ketoreductase-encoding polynucleotides include those described herein. 

Naturally occurring KRED enzymes can be found in a wide range of bacteria and 
yeasts. Several naturally occurring KRED gene and enzyme sequences have been reported 
in the literature, such as, Candida magnoliae (Genbank Acc. No. JC7338; GLi 136053 S), 
50 Candida parapsilosis (Genbank Ac. No. BAA2452S.1; GL2815409), Sporobolomyces 
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salmicolor (Genbank Acc. No. AF1 60799; GI 6539734). Polynucleotide sequences encoding 
the ketoreductase from Candida magnoliae are provided as SEQ ID NOS: 1 (CR2-5), 3 
(CR1-2), 5 (CR1-3), and 7 (CR2-4). SEQ ID NOS: 1 (CR2-5), 5 (CR1-3), and 7 (CR2-4) are 
variants that encode the C magnoliae protein (SEQ ED NOS: 2, 6, and S). SEQ ID NO: 3 
5 (CR1-2) encodes a variant that differs from the C. magnoliae protein by one amino acid 
change (SEQ ID NO: 4). Enzymatic reduction of |3-keto esters has been reported for a 
carbonyl reductase from Rhodococcus eiythropolis (Peters, AppI. Microbiol. BiotechnoL 
(1992) 38:334-340; Zelinski, J. BiotechnoL (1994) 33:283-292), an aldehyde reductase from 
Sporoboromyces salmonicolor AKU 4429 (Shimizu, BiotechnoL Lett. (1990) 12:593-596; 

1 0 Appl. Enviro n. Microbiol . (1:990) 56:2374-2377). Enzymes such as those derived from S. 
cerevisiae (J. Org. Ghem. (1991) 56:4778; Biosci. Biotech. Biochem . (1994) 58:2236), 
Sporobolomyces salmonicolor (Biochim. Biophvs. Acta (1992) 1 122:57), Sporobolomyces 
$P- (Biosci. Biotech. Biochem. (1993) 57:303; Japanese patent publication JP2566960), 
Candida albicans fBiosci. Biotech. Biochem. (1993) 57:303), Candida macedoniensis (Arch. 

1 5 Biochem. Biophvs. ( 1 992) 294-469), Geofrichium candidum (Enzyme Microbiol. TechnoL 
(1 992) 14:731) have been used for the reduction of ethyl 4-chloro-3-acetoacetate (ECAA). 
U.S. Pat. No. 6,168,935 describes the use of glycerol dehydrogenase (Tetrahedron Lett. 
(1988) 29:2453), alcohol dehydrogenase (ADH) from Thermoanaerobhtm brockii (JACS 
(1985) 107:4028), or Sulfolobus solfataricus (BiotechnoL Lett. (199n 13:31) or 

20 Pseudomonas sp. (U.S. Pat. No, 5,385,833; J. Org; Chem. (1992) 57:1526). 

Suitable non-naturally occurring ketoreductases can be readily identified by applying 
known methods, including mutagenesis, directed evolution, and the like, followed by 
screening for activity using the method described in Example 4. For example, these methods 
can be readily applied to naturally occurring ketoreductases, including the ones described 

25 herein. Exemplary non-naturally occurring ketoreductases are provided herein as SEQ ID 
NOS: 40 (KRED krhl33c) ? 42 (KRED krh215), 44 (KRED krh267), 46 (KRED krh2S7), 48 
(KRED krh320), 50 (KRED krh326), 52 (KRED krh408), 54 (KRED krh417), 56 (KRED 
krh4S3), 58 (KRED krh476), 60 (KRED krh495), 1 14 (KRED S01040430), 1 16 (KRED 
S01091361), 118 (KRED S01091625), and 120 (KRED S01094648). The polynucleotide 

30 sequences that encode them are provided herein as SEQ ID NOS: 39, 41 , 43, 45, 47, 49, 5 1 , 
53, 55, 57, 59, 1 13, 1 15, 1 17, and 1 19, respectively. Additional non-naturally occurring 
ketoreductases that are suitable for use in the practice of the present invention are provided in 
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the patent application entitled, "Improved Ketoreductase Polypeptides and Related 
Polynucleotides," corresponding to Attorney Docket No. 0190.1 10US/15077US01, filed on 
August 11, 2003, and assigned U.S. application serial number 60/494,195, and in the patent 
application entitled, "Improved Ketoreductase Polypeptides and Related Polynucleotides," 
corresponding to Attorney Docket No. 0190.210US/15077US02, both of which are 
incorporated herein by reference in their entireties. 

Ketoreductases employed in the practice of the present invention typically exhibit an 
activity of at least about 1 umol/min/mg in the assay described in Example 4, using the 
4-halo-3-ketobutyric acid ester or amide substrate of interest. Ketoreductases employed in 
the practice of the present invention may exhibit an activity of at least 1 umol/min/mg to 
about 10 umol/min/mg and sometimes at least about 10 2 umol/min/mg, up to about 10 3 
umol/min/mg or higher. 

4-halo-3-ketobutyric acid esters and amides employed in the practice of the present 
invention can be readily purchased or synthesized using known methods. Exemplary 4-halo- 
1 5 3-ketobutyric acid ester substrates include those having the structure IV: 



10 



O 

R 2 II R 3 




OR 1 



,5. 



rv 



where: 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

20 and 

R 1 , R 2 , R 3 , R 4 , and R 5 are selected as described for structure 1 A. 

Specific 4-halo-3-ketobutyric acid esters that may be employed in the practice of the 
present invention include ethyl 4-chloro-3-ketobutyric acid ester (i.e., where X is chlorine, 
R\ R 2 , R 3 , and R 4 are each hydrogen, and R 5 is ethyl), methyl 4-chloro-3-ketobutyric acid 
25 ester (i.e., where X is chlorine, R 1 , R 2 , R 3 , and R 4 are each hydrogen, and R 5 is methyl), ethyl 
4-bromo-3-ketobutyric acid ester (i.e., where X is bromine, R 1 , R 2 , R 3 , and R 4 are each 
hydrogen, and R 5 is ethyl), ethyl 4-iodo-3-ketobutyric acid ester (i.e., where X is iodine R 1 
R , R , and R are each hydrogen, and R 5 is ethyl), methyl 4-bromo-3-ketobutyric acid ester 
(i.e., where X is bromine, R 1 , R 2 , R 3 , and R 4 are each hydrogen, and R 5 is methyl), methyl 
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4-iodo-3-ketobutyric acid ester (i.e., where X is iodine, R 1 , R 2 , R 3 , and R 4 are each hydrogen, 
and R 5 is methyl), t-butyl-4-chloro-3-ketobutyric acid ester (i.e., where X is chlorine, R 1 , R 2 , 
R 3 , and R 4 are each hydrogen, and R 5 is t-butyl), t-butyl-4-bromo-3-ketobutyric acid ester 
(i.e., where X is bromine, R 1 , R 2 3 R 3 , and R 4 are each hydrogen, and R 5 is t-butyl), and t- 
5 butyl-4-iodo-3-ketobutyric acid ester (i.e., where X is iodine, R 1 , R 2 , R 3 , and R 4 are each 
hydrogen, and R 5 is t-butyl). In certain embodiments, at least one ofR'.R 2 ^ 3 , andR 4 is a 
lower alkyl, such as, for example, methyl, ethyl, or propyl. 

When 4-halo-3-ketobutyric acid ester substrates having the structure of compound IV 
are reduced during the KRED-catalyzed conversion of the present invention, 4-halo- 
10 3-hydroxybutyric acid esters are generated having the structure V: 




V 

where X, R 1 , R 2 , R 3 , R 5 and R 5 are as described for structure IV. 

4-halo-3-hydroxybutyric acid esters or amides produced by the ketoreductase- 
1 5 catalyzed reduction method of the present invention can then be readily used in the 

halohydrin dehalogenase-catalyzed conversions of the present invention. For example, 4- 
halo-3-hydroxybutyric acid esters corresponding to structure V can be used as substrate for 
conversion by HHDH in the presence of cyanide to generate 4-cyano-3-hydroxybutyric acid 
esters having the structure VI: 




VI 



where R l , R 2 , R 3 , R 4 , and R 5 are as described as for compound V. 

The term "cofactor regeneration system" refers herein to a set of reactants that 
participate in a reaction that reduces the oxidized form of the cofactor (e.g., NADP to 
25 NADPH). Cofactors oxidized by the ketoreductase-catalyzed reduction of the 4-halo-3- 

ketobutyric acid ester or amide are regenerated in reduced form by the cofactor regeneration 
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system. Cofactor regeneration systems comprise a stoichiometric reductant that is a source of 
reducing hydrogen equivalents and is capable of reducing the oxidized form of the cofactor. 
The cofactor regeneration system may further comprise a catalyst, for example an enzyme 
catalyst, that catalyzes the reduction of the oxidized form of the cofactor by the reductant. 
5 Cofactor regeneration systems to regenerate NADH or NADPH from NAD or NADP, 
respectively, are known in the art and may be used in the present invention. 

. Suitable cofactor regeneration systems employed in the practice of the present 
invention include glucose and glucose dehydrogenase, formate and formate dehydrogenase, 
glucose-6-phosphate and glucose-6-phosphate dehydrogenase, isopropyl alcohol and 

1 0 secondary alcohol dehydrogenase, phosphite and phosphite dehydrogenase,molecular 
hydrogen and hydrogenase, and the like, and may be used in combination with either 
NADP/NADPH or NAD/NADH as the cofactor. Electrochemical regeneration using 
hydrogenase may also be used as a cofactor regeneration system. See, e.g., U.S. Patent Nos. 
5,538,867 and 6,495,023, both of which are incorporated herein by reference. Chemical 

1 5 cofactor regeneration systems comprising a metal catalyst and a reducing agent (for example, 
. molecular hydrogen or formate) are also suitable. See, e.g., PCT publication WO • 
200005373 1 , which is incorporated herein by reference. 

The terms "glucose dehydrogenase" and "GDH" are used interchangeably herein to 
refer to an NAD or NADP-dependent enzyme that catalyzes the conversion of D-glucose and 

20 NAD or NADP to gluconic acid and NADH or NADPH, respectively. Equation (5) describes 
the glucose dehydrogenase-catalyzed reduction of NAD or NADP by glucose. 



glucose + NAD(P) + + H 2 0 gluconic acid + NAD(P)H + H + 



(5) 



25 Glucose dehydrogenases that are suitable for use in the practice of the present 

invention include both naturally occurring glucose dehydrogenases, as well as non-naturally 
occurring glucose dehydrogenases. Naturally occurring glucose dehydrogenase encoding 
genes have been reported in the literature. For example, the Bacillus subtilis 61297 GDH 
gene was expressed in E. coli and was reported to exhibit the same physicochemical 

30 properties as the enzyme produced in its native host (Vasantha, et al., Proc, Natl. Acad. Sci. 
USA (1983) 80:785). The gene sequence of the B. subtilis GDH gene, which corresponds to 
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Genbank Acc. No. M12276, was reported by Lampel, et al. ( 7. Bacterid: (1986) 166:238- 
243) and in corrected form by Yamane, et al. ("Microbiology (1996) 142:3047-3056) as 
Genbank Acc. No. D50453. Naturally occurring GDH genes also include those that encode 
the GDH from B. cereus ATCC 14579 (Nature (2003) 423:87-91; Genbank Acc. No. 
5 AE017013 ) and£. megaterium f Eur. J. Biochem. (1988) 174:485-490, Genbank Acc. No. 
X12370; J. Feiment. Bioeng. (1990) 70:363-369, Genbank Acc. No. GI216270). Glucose 
dehydrogenases from Bacillus sp. are provided herein as SEQ IE) NOS: 10 and 12 (encoded 
by polynucleotide sequences corresponding to SEQ ID NOS: 9 and 1 1, respectively). 

Non-naturally occurring glucose' dehydrogenases may be generated using known 

10 methods, such as, for example, mutagenesis, directed evolution, and the like. GDH enzymes 
having suitable activity, whether naturally occurring or non-naturally occurring, may be 
readily identified using the assay described in Example 4. Exemplary non-naturally 
. occurring halohydrin dehalogenases are provided herein as SEQ ID NOS: 62 (GDH 2313), 64 
(GDH 2331), 66 (GDH 2279), 68 (GDH 2379), 122 (GDH S01024744), 124 (GDH 

15 SO 1052992), and 126 (GDH SO 10637 14). The polynucleotide sequences that encode them 
are provided herein as SEQ ID NOS: 61, 63, 65, 67, 121, 123, and 125, respectively. 
Additional non-naturally occurring glucose dehydrogenases that are suitable for use in the 
practice of the present invention are provided in the patent application entitled, "Improved 
Glucose Dehydrogenase Polypeptides and Related Polynucleotides," corresponding to 

20 Attorney Docket No. 0352.1 10US/15076US01, filed on August 11, 2003, and assigned U.S. 
application serial number 60/494,300, and in the patent application entitled, "Improved 
Glucose Dehydrogenase Polypeptides and Related Polynucleotides," corresponding to 
Attorney Docket No. 0352,2 10US/15076US02, filed on February 18, 2004, and assigned U.S. 
application serial number , both of which are incorporated 

25 herein by reference in their entireties. 

Glucose dehydrogenases employed in the practice of the present invention may 
exhibit an activity of at least about 10 pnol/min/mg and sometimes at least about 10 2 
jLimol/min/mg or about 1 0 3 (imol/min/mg, up to about 10 4 prnol/min/mg or higher in the assay 
described in Example 4. 

30 When glucose and glucose dehydrogenase are employed as the cofactor regeneration 

system, as the 4-halo-3-ketobutyric acid ester or amide is reduced by the ketoreductase and 
NADH or NADPH, the resulting NAD or NADP is reduced by the coupled oxidation of 
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10 



15 



glucose to gluconic acid by the glucose dehydrogenase. The net reaction is described by 
equation (6), which is the summation of equations (4) and (5): 



O O 



OH O 

X - JJ ^ A OR + glucose X ^A)R + 9luconicacid 

NAD(P)H 



(6) 



The ketoreductase-catalyzed reduction of 4-halo-3-ketobutyric acid ester or amide is 
generally carried out in a solvent. The solvent may be a co-solvent system, such as, for 
example, an aqueous co-solvent system. Suitable solvents (including co-solvent systems) for 
carrying out this conversion are the same as those described above for the HHDH-catalyzed 
conversion of 4-halo-3-hydroxybutyric acid esters and amides to 4-cyano-3-hydroxybutyric 
acid esters and amides. 

The aqueous solvent (water or aqueous co-solvent system) may be pH-buffered or 
unbuffered. The conversion of the 4-halo-3-ketobutyric acid ester or amide to the 4-haIo- 
3-hydroxybutyric acid ester or amide may be carried out at a pH of about 5 or above. 
Generally, the conversion is carried out at a pH of about 10 or below, usually in the range of 
from about 5 to about 1 0. Typically, the conversion is carried out at a pH of about 9 or 
below, usually in the range of from about 5 to about 9. Preferably, the conversion is carried 
out at a pH of about S or below, usually in the range of from about 5 to about S, and more 
preferably in the range of from about 6 to about 8. Alternatively, the conversion may be 
20 carried out at neutral pH, i.e., about 7. 

When the glucose/glucose dehydrogenase cefaclor regeneration system is employed, 
. the co-production of gluconic acid (pKa = 3.6), as represented in equation (6) causes the pH 
of the reaction mixture to drop if the resulting aqueous gluconic acid is not otherwise 
neutralized. The pH of the reaction mixture may be maintained at the desired level by 
25 standard buffering techniques, wherein the buffer neutralizes the gluconic acid up to the 
buffering capacity provided, or by the addition of a base concurrent with the course of the 
conversion. Suitable buffers and procedures for buffering and suitable bases and procedures 
for the addition of base during the course of the conversion are the same as those described 
above for the HHDH-catalyzed conversion of 4-halo-3-hydroxybutyrate esters and amides to 
30 4-cyano-3-hydroxybuyrate esters and amides. 
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In the ketoreductase-catalyzed reduction of the 4-halo-3-ketobutyric acid ester or 
amide using glucose/glucose dehydrogenase for cofactor regeneration, when the pH is 
maintained by buffering or by addition of a base over the course of the conversion, an 
aqueous gluconate salt rather than aqueous gluconic acid is the product of the overall process. 
For example, equation (7) represents the overall process when aqueous sodium hydroxide 
(Na + + OH") is added over the course of the reaction to maintain the pH: 

0 0 OH. O 

X — il ^OR + 9 |ucose + ( Na++ OH") JS*j^ X^A^A 0R + (Na + + gluconate! 

NAD(P)H (?) 

When base addition is employed to neutralize the gluconic acid released during the 
ketoreductase-catalyzed reduction of a 4-halo-3-ketobutyric acid ester or amide using the 
glucose/glucose dehydrogenase cofactor regeneration system, the progress of the conversion 
may be monitored by the amount of base added to maintain the pH. Typically bases added to 
unbuffered or partially buffered reaction mixtures over the course of conversion are added in 
aqueous solutions. 

The terms "formate dehydrogenase" and "FDH" are used interchangeably herein to 
refer to an NAD or NADP-dependent enzyme that catalyzes the conversion of formate and 
NAD or NADP to carbon dioxide and NADH or NADPH, respectively. Formate 
dehydrogenases that are suitable for use in the practice of the present invention include both 
naturally occurring formate dehydrogenases, as well as non-naturally. occurring formate 
dehydrogenases. Formate dehydrogenases include those corresponding to SEQ ID NOS: 70 
(Pseudomonas sp.) and 72 (Candida boidimi), which are encoded by polynucleotide 
sequences corresponding to SEQ ID NOS: 69 and 71, respectively. Formate dehydrogenases 
employed in the practice of the present invention, whether naturally occurring or non- 
naturally occurring, may exhibit an activity of at least about 1 jxmol/min/mg, sometimes at 
least about 10 |imol/min/mg, or at least about 10 2 [imol/min/mg, up to about 10 3 
(amol/min/mg or higher, and can be readily screened for activity in the assay described in 
Example 4. 

As used herein, the term "formate" refers to formate anion (HCOf), formic acid 
(HC0 2 H), and mixtures thereof. Formate may be provided in the form of a salt, typically an 
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alkali or ammonium salt (for example, HC0 2 Na, KHC0 2 NH 4 , and the like), in the form of 
formic acid, typically aqueous formic acid, or mixtures thereof. Fonnic acid is a moderate 
acid. In aqueous solutions within several pH units of its pKa (pKa = 3.7 in water) formate is 
present as both HCCV and HC0 2 H in equilibrium concentrations. At pH values above about 
5 4, formate is predominantly present as HC0 2 ~ When formate is provided as formic acid, the 
reaction mixture is typically buffered or made less acidic by adding a base to provide the 
desired pH, typically of about 5 or above. Suitable bases for neutralization of formic acid are 
as described for neutralization of hydrocyanic acid, above. 

For pH values above about 5, at which fonnate is predominantly present as HC0 2 ~ 
10 equation (8) describes the fonnate dehydrogenase-catalyzed reduction of NAD or NADP by 
fonnate. 



FDH 

HC0 2 " + NAD(P)+ —~ C0 2 + NADP(H) 



(8) 



When fonnate and formate dehydrogenase are employed as the cofactor regeneration 
15 system, as the 4-halo-3-ketobutyric acid ester or amide is reduced by the ketoreductase and 
NADH or NADPH, the resulting NAD or NADP is reduced by the coupled oxidation of 
formate to carbon dioxide by the formate dehydrogenase. The net reaction is described by 
equation (9), which is the summation of equations (4) and (8): 

x^A 0R ♦ hco 2 - ♦ h* -mat xXa or+ co 2 

20 NAD(P)H (9) 

Equation (9) shows that when the formate/formate dehydrogenase cofactor 
regeneration system is employed for the reduction of the 4-halo-3-ketobutyric acid ester or 
amide in aqueous solution with pH above about 5, protons in solution are consumed and the 
reaction causes the pH of the reaction mixture to rise if it is not otherwise buffered or re- 

25 • acidified. The pH of the reaction mixture may be maintained at the desired level by standard 
buffering techniques, wherein the buffer releases protons up to the buffering capacity 
provided, or by the addition of an acid concurrent with the course of the conversion. Suitable 
acids to add during the course of the reaction to maintain the pH include organic acids, for 
example carboxylic acids, sulfonic acids, phosphonic acids, and the like, mineral acids, for 
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example hydrohalic acids (such as hydrochloric acid), sulfuric acid, phosphoric acid, and the 
like, acidic salts, for example dihydrogenphosphate salts (e.g. KH 2 P0 4 ), bisulfate salts (e.g. 
NaHS0 4 ) and the like. Particularly preferred is formic acid, whereby both the formate 
concentration and the pH of the solution are maintained. For example, equation (10) 
5 represents the overall process when formic acid (HC0 2 H) is added over the course of the 
reaction to maintain an initial pH above about 5. While the formate is present predominantly 
as HC0 2 " in the reaction mixture, the HC0 2 " concentration is maintained while the 
conversion in net consumes the added formic acid. ' 

xXX OR+ HCO 2 H-^ x Xi 0R + cO 2 . 

NAD(P)H 

HC0 2 - (i0) 

When acid addition is employed to maintain the pH during the ketoreductase- 
catalyzed reduction of a 4-halo-3-ketobutyric acid ester or amide using the formate/formate 
dehydrogenase cofactor regeneration system, the progress of the conversion may be 
1 5 monitored by the amount of acid added to maintain the pH. Typically, acids added to 

unbuffered or partially buffered reaction mixtures over the course of conversion are added in 
aqueous solutions. 

In carrying out the methods of the present invention, either the oxidized or reduced 
form of the cofactor may be provided initially. As described above, the cofactor 

20 regeneration system converts oxidized cofactor to its reduced form, which is then utilized in 
the reduction of the ketoreductase substrate (i.e., 4-halo-3-ketobutyric acid ester or amide) to 
the corresponding halohydrin. 

As with the halohydrin dehalogenases, the ketoreductase and enzymes of the cofactor 
regeneration system may be provided to the reaction mixture for converting 4-halo-3- 

25 ketobutyric acid ester or amide in the form of purified enzyme, cell extract, cell lysate, or 
whole cells transformed with gene(s) encoding the ketoreductase and enzymes of the cofactor 
regeneration system. The genes encoding the enzymes can be transformed into host cells 
either separately, or together into the same host cell. For example, in one embodiment one 
set of host cells can be transformed with ketoreductase encoding gene(s) and another set can 
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be transformed with cofactor regeneration system enzyme (e.g., GDH, FDH, and the like) 
encoding gene(s). Both sets of transformed cells can be utilized together in the reaction 
mixture in the form of whole cells or cell lysates or cell extract derived therefrom. 
Alternatively, a host cell can be transformed with genes encoding both ketoreductase and a 
cofactor regeneration system enzyme, such that each cell expresses both ketoreductase and 
the cofactor regeneration system enzyme. In a further embodiment, the host cell can be 
transformed with genes encoding ketoreductase, a cofactor regeneration system enzyme, and 
a halohydrin dehalogeriase. These cells can be utilized in the methods of the present 
invention to provide the enzymes in the form of whole cells, cell lysate, or cell extract. As 
described for the reaction mixture of the HHDH-catalyzed method, the solid reactants (i.e., 
enzymes, salts, cofactor regeneration system, cofactor, and the like) may be provided in a 
variety of different forms, including powder (e.g., lyophilized, spray dried, and the like), 
solution, emulsion, suspension, and the like. 

The quantities of reactants used in the reduction step will generally vary depending on 
the quantities of 4-halo-3-hydroxybutyric acid ester or amide desired, and concomitantly the 
amount of ketoreductase substrate employed. The following guidelines can be used to 
determine the amounts of ketoreductase, cofactor, and cofactor regeneration system to use. 
Generally, 4-halo-3-ketobutyric acid esters and amides are employed at a concentration of 
about 10 to 500 grams/liter using from about 10 mg to about 5 g of ketoreductase and about 
25 mg to about 5 g of cofactor. Those having ordinary skill in the art will readiiy understand 
how to vary these quantities to tailor them to the desired level of productivity and scale of 
production. Appropriate quantities of cofactor regeneration system may be readily 
determined by routine experimentation based on the amount of cofactor and/or ketoreductase 
utilized. In general, the reductant (e.g. glucose, formate) is utilized at levels above the 
equimolar level of ketoreductase substrate to achieve essentially complete or near complete 
conversion of the ketoreductase substrate. 

The order of addition of reactants is not critical. The reactants may be added together 
at the same time to a solvent (e.g., monophasic solvent, biphasic aqueous co-solvent system, 
and the like), or alternatively, some of the reactants may be added separately, and some 
together at different time points. For example, the cofactor regeneration system, cofactor, 
ketoreductase, and ketoreductase substrate may be added first to the solvent 
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For improved mixing efficiency when an aqueous co-solvent system is used, the 
cofactor regeneration system, ketoreductase, and cofactor are usually added and mixed into 
the aqueous phase first. The organic phase may then be added and mixed in, followed by 
addition of the ketoreductase substrate. Alternatively, the ketoreductase substrate may be 
premixed in the organic phase, prior to addition to the aqueous phase. 

As for the halohydrin dehalogenase-catalyzed conversion of 4-halo-3-hydroxybutyric 
acid esters and amides, suitable conditions for carrying out the ketoreductase-catalyzed 
reduction of 4-halo-3-ketobutyric acids esters and amides of the present invention include a 
wide variety of conditions that can be readily determined by those having ordinary skill in the 
art. Suitable temperatures for carrying out the ketoreductase-catalyzed reduction step are 
typically in the range of from about 15°C to about 75°C. Usually, the reactions are carried 
out at a temperature in the range of from about 20°C to about 55°C, and preferably from 
about 20°C to about 45°C. The reaction may also be carried out under ambient conditions, 

As in the halohydrin dehalogenase-catalyzed reaction, the ketoreductase-catalyzed 
reaction is allowed to proceed until essentially complete or near complete conversion of 
substrate is observed using methods that are known in the art. As in the halohydrin 
dehalogenase-catalyzed reaction, the progression of the ketoreductase-catalyzed reaction may 
be monitored by monitoring the amount of base or acid added to counter the pH change that 
may otherwise occur with the particular cofactor regeneration system that is used, as 
described above. 

The ketoreductase-catalyzed reduction of the 4-halo-3-ketobutyric acid ester or amide 
substrate generates a new stereogenic carbon at the 3-position of the 4-halo-3-hydroxybutyric 
acid ester or amide product. Typically, the 4-halo-3-hydroxybutyric acid ester or amide is 
generated with a relatively high stereoselectivity at the 3-position. Thus, the 4-halo- 
3-hydroxybutyric acid esters and amides generated by the ketoreductase-catalyzed reduction 
of 4-halo-3-ketobutyric acid esters and amides are typically chiral and non-racemic. The 
ketoreductase reactions used in the present invention typically generate preferred nonracemic, 
chiral 4-halo-3-hydroxybutyric acid esters having an e.e. of at least about 90% e.e., usually at 
least about 95% e.e., and typically at least about 99% e.e. The Examples illustrate 
embodiments providing ethyl (S)-4-chloro-3-hydroxybutyrate with, greater than 99% e.e. 

As used herein, the term "enantiomeric excess" or " e.e." refers to the absolute 
difference between the mole or weight fractions of major (F (+) ) and minor (F ( .)) enantiomers 
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(i.e., | F( +) - F w 1 ), where F (+) + F H = 1 . Percent e.e. is 100 X | F (+) - F H | . Enantiomeric 
composition can be readily characterized by using the gas chromatography method described 
in Example 6, hereinbelow, and using methods that are known in the art. 

As described above, when these nonracemic chiral 4-halo-3-hydroxybutyric acid 
esters or amides are used as substrates in the halohydrin dehalogenase-catalyzed reactions of 
the present invention, the resulting 4-substituted-4-hydroxybutyric acid esters or amides are 
substantially equally nonracemic, with little or no loss in stereopurity. The combination of 
the high stereoselectivity of the ketoreductase-catalyzed production' of the nonracemic 4-halo- 
3-hydroxybutyric acid esters or amides and the high stereofidelity of the halohydrin 
dehalogenase-catalyzed conversion of them to the corresponding nonracemic 4-cyano- 
3-hydroxybutyric acid esters or amides provides a particularly attractive inventive process for 
the overall production of nonracemic 4-cyano-3-hydroxybutyric acid esters or amides of high 
e.e. from 4-halo-3-ketobutyric acid esters or amides. 

A further significant characteristic of the present invention is that the yield of chiral 
products generated is very high. Typically, the yields of 4-halo-3-hydroxybutyric acid ester 
or amide and 4-nucleophile substituted-3-hydroxybutyric acid ester or amide products 
generated in accordance with the methods of the present invention are at least about 70%, 
usually at least about 80%, typically at least about 90%, and may be at least about 95%. The 
computation of product yield is based on initial substrate quantity provided and the amount of 
product formed in the reaction mixture. Product 4-halo-3-hydroxybutyric acid ester or amide 
may be optionally purified prior to contacting with the halohydrin dehalogenase. As used 
herein, the term "purified' 1 refers to a process in which a separation process is applied to a 
mixture, resulting in an increase in concentration of one component relative to other 
components in the mixture. Suitable purification processes employed in the practice of the 
present invention include, for example, filtration, solid or liquid phase extraction, distillation, 
and the like. 

If the 4-halo-3-hydroxybutyric acid ester or amide is purified from the ketoreductase 
reaction mixture, it is subsequently added to a solvent (e.g., a monophasic solvent, a biphasic 
aqueous co-solvent system) with the halohydrin dehalogenase and nucleophile. 
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III. ENZYMATIC CONVERSION OF 4-H ALO-3 -KETOB UT YRIC ACID 

ESTER/AMIDE TO 4-NUCLEOPHILE SUBSTITUTED-3-HYDRQXYBUTYRIC 
ACID ESTER/AMIDE IN A SINGLE REACTION VESSEL 
The present invention provides a method for carrying out the conversion of 4-halo- 
5 3-ketobutyric acid esters and amides to the corresponding 4-nucleophile substituted- 

3-hydroxybutyric acid esters and amides in a single reaction vessel, the method comprising 
contacting the 4-halo-3-ketobutyric acid ester or amide with a ketoreductase, a cofactor, a 
cofactor regeneration system, a nucleophile, arid a halohydrin dehalogenase to form a 
reaction mixture for converting the 4-halo-3-ketobutyric acid ester or amide to a 
1 0 4-nucleophile substituted-3-hydroxybutyric acid- ester or amide 

Mechanistically, this single-vessel method proceeds via ketoreductase-catalyzed 
conversion of the 4-halo-3-ketobutyric acid ester or amide to provide the 4-halo-3- 
hydroxybutyric acid ester or amide in situ, and consequent halohydrin dehalogenase- 
catalyzed conversion of the 4-halo-3-hydroxybutyric acid ester or amide to the corresponding 
1 5 4-nucleophile substituted-3-hydroxybutyric acid ester or amide. Significantly, the 4-halo- 
3-hydroxybutyric acid ester or amide produced by the ketoreductase-catalyzed reaction is not 
separated or recovered prior to its contact with halohydrin dehalogenase and nucleophile 
(e.g., cyanide and the like) for its conversion to 4-nucleophile substituted-3-hydroxybutyric 
acid ester or amide. 

20 Suitable reactants (substrates, enzymes, cofactors), solvents, pH, temperature, and 

other reaction conditions and procedures for the single-vessel conversion of 4-halo-3- 
ketobutyric acid ester or amide to 4-nucleophile substituted-3-hydroxybutyric acid ester or 
amide are the same as those described above for the carrying out the halohydrin . 
dehalogenase-catalyzed conversion of 4-halo-3-hydroxybutyric acid esters and amides to the 

25 corresponding 4-nucleophile substituted-3-hydroxybutyric acid esters and amides, . 

When glucose and glucose dehydrogenase are used as the cofactor regeneration 
system and two equivalents of base are added during the course of the reaction to neutralize 
both the gluconic acid and hydrohalic acid produced and maintain the initial pH of the . 
reaction mixture (for initial pHs in the range of about 5 to about 9), the overall process in a 

30 single-vessel reaction is described by equation (11), which is the summation of equations (2) 
and (7), wherein aqueous sodium hydroxide is illustrated as the base, 
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O O 

X nAA qr + glucose + HCN + 2(Na + + ^hn KRED ' GDH, NAD(P)H . 

' HHDH 

OH O 

NC^A^A QR + (Na + + gluconate") + (Na + + X") + H 2 0 ^ 

Other single-vessel overall process equations can result from summing equations 
describing other options for conducting the halohydrin dehalogenase-catalyzed reaction (e.g.. 
using a cyanide salt as the base) and/or the ketoreductase reaction (e.g. using formate and 
formate dehydrogenase as the cofactor regeneration system), as described above for the 
separately conducted reactions. 

It will also be understood that the same single-vessel result may be obtained by first 
conducting the ketoreductase reaction separately as described above, then subsequently 
adding halohydrin dehalogenase and cyanide into the ketoreductase reaction mixture and 
conducting the halohydrin dehalogenase reaction in the presence of the ketoreductase 
reaction components. 

An embodiment of a single-vessel process for converting a 4-haio-3-ketobutyric acid 
ester to a 4-cyano-3-hydroxybutyric acid ester is illustrated in Example 24. 

IV -. HALOHYDRIN DEHALOGENASE-C AT A T ,YZED CONVERSION OF VTCTNAT. 
HALO. HYD ROXY SUBSTITUTED CAR BOXYLIC A (TIT) ESTERS TO 
VICINAL CYANO. HYDROXY SUBS TITUTED CARBOXYLIC ACTD ESTERS 
In addition to 4-halo-3-hydroxybutyric acid ester substrates, it has been discovered 
that halohydrin dehalogenases can be used to catalyze the conversion of other vicinal halo, 
hydroxy substituted carboxylic acid ester substrates to their corresponding vicinal cyano, 
hydroxy substituted carboxylic acid esters, using the same conditions as described in part I. 
Thus, the present invention also provides a method for producing a vicinal cyano, hydroxy 
substituted carboxylic acid ester from a vicinal halo, hydroxy substituted carboxylic acid 
ester, the method comprising: 

(a) providing a vicinal halo, hydroxy substituted carboxylic acid ester, 

wherein the halo substituent is selected from the group consisting of 
chlorine, bromine, and iodine; and 
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(b) contacting the vicinal halo, hydroxy substituted carboxylic acid ester with a 
halohydrin dehalogenase and cyanide under conditions suitable to form a reaction mixture for 
converting the vicinal halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, 
hydroxy substituted carboxylic acid ester. 
5 As used herein, the term "vicinal halo, hydroxy substituted" refers to halogen and 

hydroxyl group substitutions on adjacent carbons. The term "vicinal cyano, hydroxy 
substituted" is used herein to refer to cyano (ON) and hydroxyl groups substitutions on 
adjacent carbons. Vicinal halo, hydroxyl substituted carboxylic acid esters that are suitable 
for use in the practice of the present invention include those that are not 4-halo-3- 

10 hydroxybutyric acid ester. Concomitantly, vicinal cyano, hydroxy substituted carboxylic acid 
ester product generated by these methods include those that are not 4-eyano-3- 
hydroxybutyric acid ester. 

Preferred vicinal halo, hydroxy substituted carboxylic acid ester substrates of the 
present invention are chiral, being stereogenic at the hydroxy-substituted carbon, and may be 

1 5 racemic or non-racemic. In these embodiments, certain halohydrin dehalogenase enzymes 
used in the process of the present invention convert the chiral vicinal halo, hydroxyl 
substituted carboxylic acid ester substrate to the corresponding vicinal cyano, hydroxy 
substituted carboxylic acid ester with retention of the absolute stereochemistry at the 
stereogenic hydroxy-substituted carbon. Non-racemic chiral vicinal halo, hydroxy 

20 substituted carboxylic acid ester substrates may be converted to substantially equally non- 
racemic vicinal cyano, hydroxy substituted carboxylic acid ester products with little or no 
loss in stereopurity. 

Suitable vicinal halo, hydroxy substituted carboxylic acid esters employed in the 
practice of the present invention include those having the structure VII, which provide vicinal 
25 cyano, hydroxy substituted carboxylic acid ester having the corresponding structure VIIL 





VII 



VIII 



where: 
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X, R\ R 2 , R s , and R 6 are as defined for structure IA, and 
n is zero or from 1 to 9, inclusive, and 

each M n is independently selected from -C(=0)- (i.e., carbonyl) or -CR n, R n "- 5 wherein 
R n ' and R n " are each independently selected from the group consisting of hydrogen, fluorine, 
an optionally substituted lower alkyl, an optionally substituted cycloalkyl, an optionally 
substituted lower alkenyl, an optionally substituted aryl, an optionally substituted arylalkyl, 
amino, an optionally substituted lower alkylamino, an optionally substituted cycloalkylamino, 
an optionally substituted lower alkoxy, an optionally substituted cycloalkoxy, an optionally 
substituted aryioxy, an optionally substituted aryl, hydroxyl, nitro, amino, cyano,- carboxy 
(i.e. a carboxylate or carboxylic acid group), carboalkoxy (i.e. an ester group), carbamide 
(i.e.an amide group), and acyl (i.e. forming a ketone). 

Suitable vicinal halo, hydroxy substituted carboxylic acid esters have the structure VII 
with n=l, and include the 4-halo-3-hydroxybutyric acid esters previously described. 

Other suitable vicinal halo, hydroxy substituted carboxylic acid esters have the 
structure VII with n=0, thereby having structure IX, which provide vicinal cyano, hydroxy 
substituted carboxylic acid ester having the corresponding structure X. 

- OH . OH 





X 
where: 

X, R l ? R 2 , R 5 , and R 6 are as defined for structure VII. 
Suitable vicinal halo, hydroxy substituted carboxylic acid esters having the structure IX 
include 4-halo-2-hydroxypropionic acid esters'. With respect to structures VII and IX, X is 
typically chlorine or bromine, and preferably chlorine. With respect to structures VII through 
X, R l , R 2 , R 5 , and R° are preferably each independently either hydrogen or a lower alkyl. 

Other suitable vicinal halo, hydroxy substituted carboxylic acid esters have the 
structure VII with n from 2 to 9, inclusive. Typically, n is from 2 to 8, inclusive, and more 
typically, from 2 to 7, inclusive, and usually, from 2 to 6, inclusive. Suitable vicinal halo, 
hydroxy substituted carboxylic acid esters having n=2 include 5-halo-4-hydroxypentanoic 
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acid esters. Suitable vicinal halo, hydroxy substituted carboxylic acid esters having n=3 
include 6-halo-5-hydroxyhexanioc acid esters. Suitable vicinal halo, hydroxy substituted 
carboxylic acid esters having n=4 include 7-halo-6-hydroxyheptanoic acid esters. Suitable 
vicinal halo, hydroxy substituted carboxylic acid esters having n=5 include 8-halo-7- 
hydroxyoctanoic acid esters. Suitable vicinal halo, hydroxy substituted carboxylic acid esters 
having n=6 include 9-halo-S-hydroxynonanoic acid esters. Suitable vicinal halo, hydroxy 
substituted carboxylic acid esters having n=7 include 10-halo-9-hydroxydecanoic acid esters. 
Suitable vicinal halo, hydroxy substituted carboxylic acid esters having n=8 include 1 1-halo- 
1 O-hydroxyundecanoic acid esters. Suitable vicinal halo, hydroxy substituted carboxylic acid 
esters having n=9 include 12-halo-l 1-hydroxydodectanoic acid esters. 

Preferred vicinal halo, hydroxy substituted carboxylic acid esters are 6-halo-5- 
hydroxy-3-oxohexanoic acid esters having the structure XI and 6-halo-3,5-dihydroxy- 
hexanoic acid esters having the structure XIII, which provide vicinal cyano, hydroxy 
substituted carboxylic acid ester having the structures XII and XIV, respectively. 





R 1 " R 3 
XIII 

0 where for XI, XII, XIII, and XIV: 

>1 i>2 D 5 



XIV 



X, R , R", R 5 , and R° are as defined for structure VII. R 1 ', R 1 ", R 2, ( R 2, » ) R 3 ' ^ r3» 
are as .defined for R n ' and R n " in stnicture Vn. Preferably; R 1 , R 2 , R 6 , r'» j r'" > r2« r 3. ^ 
R 3 " are each independently either hydrogen or a lower.alkyl. X is typically chlorine or 
bromine. 

Preferred 6-halo-5-hydroxy-hexanoic acid esters of structure XI or XIII have X=C1. 
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Specific 6-halo-5-hydroxy-3-oxohexanoic acid esters of structure XI that may be 
employed in the practice of the present invention include t-butyl 6-chloro-5-hydroxy-3- 
oxohexarioic acid ester (i.e., where X is chlorine, R 1 , R 2 , R 6 , R 1 ', R 1 ", r 2 ", r* md R 3„ afe 
hydrogen and R 5 is t-butyl) and t-butyl 6-bromo-5-hydroxy-3-oxohexanoic acid ester (i.e., 
where X is bromine, R 1 , R 2 , R«, R 1 ', R l », R 2 », R 3 ', and R 3 » are hydrogen, and R 5 is t-butyl). 
When employed in the method of the present invention, these substrates yield the 
corresponding vicinal cyano, hydroxyl substituted product having structure XII, where R 1 , . 
R 2 , R 6 , R 1 ', R 1 ", R 2 ", R 3, } and R 3 " are hydrogen and R 5 is t-butyl. 

Specific 6-halo-3,5-dihydroxyhexanoic acid esters having the structure XIII that may 
be employed in the practice of the present invention include t-butyl 6-chloro-3,5-dihydroxy- 
hexanoic acid ester (i.e., where X is chlorine, R 1 , R 2 , R 6 , R 1 ', R 1 -', r 2 '-, r3. ( and R 3„ arg 
hydrogen, and R 5 is t-butyl) and t-butyl 6-bromo-3,5-dihydroxyhexanoic acid ester (i.e., 
where X is bromine; R 1 , R 2 , R 6 , R 1 ', R 1 ", R 2 », R 3 ', and R 3 » are hydrogen, and R 5 is t-butyl). 
When employed in the method of the present invention, these 6-halo-3,5-dihydroxyhexanoic 
acid ester substrates yield vicinal cyano, hydroxyl substituted product having structure XIV, 
where R 1 , R 2 , R 6 , R 1 ', R 1 ", R 2 ", R 3 ', and R 3 " are hydrogen, and R 5 is t-butyl 

Suitable halohydrin dehalogenases for catalyzing the conversion of vicinal halo, 
hydroxy substituted carboxlic acid ester substrate to vicinal cyano hydroxyl substituted 
carboxylic acid product include both naturally occurring and non-naturally occurring 
halohydrin dehalogenases, as previously described in part I supra. Halohydrin dehalogenases 
may be readily identified using the assay described in Example 4 (part (3)) and substituting 
for ethyl (S>4-chloro-3-hydroxybutyrate the vicinal halo, hydroxyl substituted carboxylic 
acid substrate of interest. Conditions that are suitable to form a reaction mixture for 
converting the vicinal halo, hydroxy substituted carboxylic acid ester to a vicinal cyano,, 
hydroxy substituted carboxylic acid ester are the same as described for converting 4-halo-3- 
hydroxybutyric acid esters to their corresponding 4-cyano-3-hydroxybutyric acid esters (see 
part I supra), e.g., cyanide sources, methods for controlling pH, temperatures, solvents and 
co-solvent systems, and the like. Exemplary methods for converting 6-halo-5-hydroxy-3- 
oxohexanoic acid esters (structure XI) and 6-halo-3,5-dihydroxyhexanoic acid esters 
(structure XIII) to their corresponding vicinal cyano, hydroxy substituted carboxylic acid 
esters (i.e., structures XII and XTV, respectively) are provided in Examples 32-41 
hereinbelow. 
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V. COMPOSITIONS 

The present invention further provides compositions that are useful for the enzymatic 
conversion of 4-halo-3-hydroxybutyric acid ester or amide to 4-nucleophile substituted-3- 
hydroxybutyric acid ester or amide. These compositions comprise a halohydrin 
dehalogenase, a 4-halo-3-hydroxybutyric acid ester or amide, and a nucleophile. In a 
preferred composition, the nucleophile is cyanide. 

In a further embodiment, the present invention provides compositions useful for 
preparing 4-nucleophile substituted-3-hydroxybutyric acid esters and amides that have a 
ketoreductase, a cofactor regeneration system, a cofactor, and a halohydrin dehalogenase. 
These compositions may further include a 4-halo-3-ketobutyric acid ester or amide. 

Any of the previously described ketoreductases, components of a cofactor 
regeneration system, cofactors, halohydrin dehalogenases, 4-halo-3-ketobutyric acid esters or 
amides, 4-halo-3-hydroxybutyric acid esters or amides, and nucleophiles may be employed in 
these compositions. 

The present invention also provides compositions that are useful for carrying out the 
conversion of a vicinal halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, 
hydroxy substituted carboxylic acid ester or amide. These compositions comprise a 
halohydrin dehalogenase, a cyanide, and a vicinal halo, hydroxyl substituted carboxylic acid 
ester. 

Compositions of the present invention may be in solid (e.g., a powder) or liquid (e.g., 
solution, emulsion, suspension, and the like) form. For example, the composition maybe in 
the form of a lyophilized or spray dried powder. Alternatively, the composition may further 
comprise a solvent. 

The compositions may further include components for pH control or processability, 
including, for example, a salt, an acid, a base, a buffer, a solubilizing agent, etc. 

VI. HALOHYDRIN DEHALOGENASES. KETOREDUCTASES, AND COFACTOR 
REGENERATION SYSTEM ENZYMES AND CORRESPONDING 
POLYNUCLEOTIDES 

In addition to the specific enzymes and polynucleotides described herein, those 
having ordinary skill in the art will recognize that known techniques can be readily applied in 
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the discovery of both naturally occurring and non-naturally occurring polynucleotides 
encoding enzymes suitable for use in the practice of the present invention. See, e.g., Ling, 
et al, "Approaches to DNA mutagenesis: an overview," Anal. Biochpm 254(2): 1 57-78 
(1997); Dale,etal, "Oligonucleotide-directed random mutagenesis using the 
phosphorothioate method," Methods Mol. RinI 57:369-74 (1996); Smith, "In vitro 
mutagenesis," Ann. Rev. Genet. , 1 9:423-462 (1985); Botstein, et al., "Strategies and 
applications of in vitro mutagenesis," Science, 229:1 1.93-1 201 (1985); Carter, "Site-directed 
mutagenesis," Biochem. J. , 237:1-7 (1986); Kramer, et al., "Point Mismatch Repair," Cell. 
38:879-887 (1984); Wells, et al., "Cassette mutagenesis: an efficient method for generation of 
multiple mutations at defined sites," Gene, 34:315-323 (1985); Minshull, et al., "Protein 
evolution by molecular breeding." Current Opinion i n Chemical Rinlnay r^^p (]999) . 
Christians, et al., "Directed evolution of thymidine kinase for AZT phosphorylation using ' 
DNA family shuffling," Nature Biotechnology, 17:259-264 (1999); Crameri, et al., "DNA 
shuffling of a family of genes from diverse species accelerates directed evolution," Nature. 
391:288-291; Crameri, et al., "Molecular evolution of an arsenate detoxification pathway by 
DNA shuffling," Nature Biotechnology, 15:436-438 (1997); Zhang, et al., "Directed 
evolution of an effective mcosidase from a galactosidase by DNA shuffling and screening," 
Proceedings of the National Academy of Sciences. US A , 94:45.4.4599. Crameri, et al. 
"Improved green fluorescent protein by molecular evolution using DNA shuffling," Nature 
Biotechnology , 14:315-319 (1996); Stemmer, "Rapid evolution of a protein in vitro by DNA 
shuffling," Nature, 370:389-391 (1994); Stemmer, "DNA shuffling by random fragmentation 
and reassembly: In vitro recombination for molecular evolution," Proceedings of the 
National Academy of Sciences, U.S. A , 91:10747-10751 (1994); WO 95/22625; WO 
97/007S; WO 97/35966; WO 98/27230; WO 00/42651; and WO 01/75767 . These and other 
known methods can be readily applied, for example, together with the assays described 
herein, to identify other ketoreductases, hatohydrin dehalogenases, and cofactor regeneration 
system enzymes having the activities described herein, as well as other desirable properties, 
e.g., altered temperature and/or pH optimums, solvent resistance, and the like. For example, 
a ketoreductase may be mutated or evolved to generate libraries that can be screened to 
identify a ketoreductase having a preference for one cofactor type over another, for example, 
NAD versus NADP, or vice-versa. 



40 



WO 2005/018579 



PCT7US2004/004819 



Polynucleic acid sequences encoding the enzymes employed in the present invention 
may be codon optimized for optimal production from the host organism selected for 
expression. Those having ordinary skill in the art will recognize that tables and other 
references providing codon preference information for a wide range of organisms are readily 
available; See e.g., Henaut and Danchin, "Escherichia coli and Salmonella," Neidhardt, et 
al. eds., ASM Press, Washington D.C. (1996) pp. 2047-2066. 

Enzymes employed in the practice of the present invention may be produced by 
transforming a vector containing a polynucleotide encoding halohydrin dehalogenase, 
ketoreductase, or a cofactor regeneration system enzyme into a host cell using well known 
molecular biology techniques. See, e.g., Berger and Kimmel, "Guide to Molecular Cloning 
Techniques", Methods in Enzvmologv. Volume 152, Academic Press, Inc., San. Diego, CA; 
Sambrook, et al., "Molecular Cloning— A Laboratory Manual," 2 nd Ed., Vol. 1-3, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; and "Current Protocols in 
Molecular Biology," P.M. Ausubel, et al., eds., Current Protocols , a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (supplemented through 
1 999). Methods for making the enzymes are illustrated in Examples 1 and 2. 

The foregoing and other aspects of the invention may be better understood in 
connection with the following non-limiting examples. 

EXAMPLES 

Example 1: Construction of Expres sion Constructs for Expression of Halohvdrin 
Dehalogenase. Ketoreductase. and Glucose Dehydrogenase, 
(1) Halohvdrin Dehalogenase fHHDH~> 

The gene for the halohydrin dehalogenase was codon optimized for expression in E. 
coli based on the amino acid sequence of the halohydrin dehalogenase from Agrobacterium 
sp. The gene was synthesized using 60-mer oligomers, and cloned into expression vector 
pCKl 10700 (depicted in Figure 2) under the control of a T5 promoter. The vectors were 
transformed into E. coli TOPI 0 (Invitrogene, Carlsbad, CA) from which plasmid DNA was 
prepared using standard methods. The plasmid DNA was then transformed into E. coli BL21 
(Stratagene, La Jolla, CA), the expression host, using standard methods. Several clones were 
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found in the expression library that expressed active HHDH. The genes from these clones 
were sequenced (see SEQ ID Nos: 13 (HHDH.l), 15 (HHDH.2), and 17 (HHDH. 16) which 
encode polypeptide sequences SEQ ID Nos. 14, 16, and IS, respectively). 

(2) Ketoreductase fKRED) 

5 . The gene for the ketoreductase was codon optimized for expression in E. coli based 

on the amino acid sequence of the ketoreductase from Candida magnoliae. The gene was 
synthesized using 60-mer oligomers, and cloned into the Sfil cloning sites of expression 
vector, pCKl 10900 (depicted in Figure 3), under the control of a lac promoter and lad 
repressor gene. The expression vector contains the pi 5 A origin of replication and the 
1 0 chloroamphenicol resistance gene. The plasmids were transformed into an E. coli 

expression host using standard methods. Several clones were found, that expressed active 
ketoreductase and their genes were sequenced to confirm the DNA sequences (see SEQ ID 
Nos: 1 (Ketoreductase 1), 3 (Ketoreductase 2), 5 (Ketoreductase 3), and 7 (Ketoreductase 4), 
which encode for polypeptide sequences SEQ ID Nos. 2, 4, 6, and 8, respectively). 

15 

(3) Glucose Dehydrogenase fGDHl 

The genes for the glucose dehydrogenase were amplified using the polymerase chain 
reaction (PCR) from genomic DNA preparations of Bacillus subtilis and Bacillus 
megaterium. The primers for the amplification reactions were designed using the published 
20 B. subtilis and B. megaterium glucose dehydrogenase gene sequences, and were as follows: 
B. subtilis forward primer (SEQ ID NO: 19) : 
5*-GAATTCGCCCATATGTATCCGGATTTAAAAGG-3^ 
B. subtilis reverse primer (SEQ ID NO: 20): 

5'-TGGCCGGATCCTCATTAACCGCGGCCTGCCTGGA-3' 
25 B, megaterium forward primer (SEQ ID NO: 21): 

5'-GAATTCGCCCATATGTATAAAGATTTAGAAGG-3' ' 

B. megaterium reverse primer (SEQ ID NO 22): 

5 t -GGCCGGATCCTCATTATCCGCGTCCTGCTTGGA-3' 

The PCR products were cloned into the Sfil cloning sites of expression vector, 
30 pCKl 10900 (depicted in Figure 3), under the control of a lac promoter and lad repressor 

gene. The expression vector contains the p 15 A origin of replication and the 

chloroamphenicol resistance gene. The plasmids were transformed into an E.coli expression 
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host using standard methods. Several clones were found to express active GDH and the 
genes were sequenced to confirm the sequences (see SEQ ID Nos: 9 (Glucose dehydrogenase 
S06-3) and 1 1 (Glucose dehydrogenase M02-6), which encode for polypeptide sequences 
SEQ ID Nos. 10 and 12, respectively). 

5 

(4) ( Formate Dehydrogenase (FDR) 

The genes for the formate dehydrogenase were codon optimized for expression in E. 
coli based on the amino acid sequences of the formate dehydrogenase from Pseudomonas 
species strain 101 (Protein Database Accession ID 2NAD_A) and Candida boidinii (Genbank 

10 Accession No. CAA09466). The genes were synthesized using 60-mer oligomers, and 
cloned into the S/zI cloning sites of expression vector, pCKl 10900 (depicted in Figure 3), 
under the control of a lac promoter and lad repressor gene. The expression vector contains 
the p 1 5 Aorigin of replication and the chloroamphenicol resistance gene. The plasmids were 
transformed into an E. coli expression host using standard methods. Clones were found that 

1 5 expressed active formate dehydrogenase and the genes were sequenced to confirm the DNA 
sequences (see SEQ ID NOS: 69 and 71, which encode for polypeptide sequences SEQ ID 
Nos. 70 and 72, respectively.) 

Example 2: Production of Enzyme 

20 (1) HHDH enzyme : 

In an aerated agitated fermentor, 10.0L of growth medium containing 0.528g/L 
ammonium sulphate; 7.5g/L of di-potassium hydrogen phosphate trihydrate; 3.7g/L of 
potassium dihydrogen phosphate; 2g/L of Tastone-154 yeast extract; 0.05g/L ferrous 
sulphate; and 3ml/L of a trace element solution containing 2g/L of calcium chloride 

15 dihydrate, 2.2g/L of zinc sulfate septahydrate, 0.5g/L manganese sulfate monohydrate, 1 g/L 
cuprous sulfate heptahydrate: 0.1 g/1 sodium borate decahydrate and 0.5 g/L EDTA, was 
brought to a temperature of 30 °C. The fermentor was inoculated with a late exponential 
culture of Escherchia coli BL21 (Stratagene, La Jolla, CA) equipped with plasmid containing 
HHDH polynucleotides as described in Example 1, then grown in a shake flask containing 

50 LB, 1% glucose (Sigma Chemical Co., St. Louis, MO), and 30jJ.g/ml chloroamphenicol 

(Sigma Chemical Co., St. Louis, MO) to a starting optical density at 600 nm (OD 6 oo) of 0.5 to 
2.0. The fermenter was agitated at 500-1500 rpm and air was supplied to the fermentation 
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vessel at 1.0-15.0 L/min to maintain a dissolved oxygen level of 30% saturation or greater. 
The pH of the culture was controlled at 7.0 by addition.of 20% v/v ammonium hydroxide. 
After the culture reached an OD 600 of 40, the temperature was reduced to 25°C and the 
expression of halohydrin dehalogenase was induced by the addition of isopropyl-p-D- 
thiogalactoside (IPTG) (Sigma Chemical Corp., St. Louis, MO) to a final concentration of 
lmM. The culture was grown for another 15 hours. After the induction, the cells were 
harvested by centrifugation and washed with 10 mM potassium phosphate buffer, pH 7.0. 
The cell paste was used directly in the downstream recovery process or was stored at -S0°C 
until use. 

(2) Ketoreductase enzyme : 

In an aerated agitated fermentor, 10.0L of growth medium containing 0.528g/L 
ammonium sulphate, 7.5g/L of di-potassium hydrogen phosphate trihydrate, 3.7g/L of 
potassium dihydrogen phosphate, 2g/L of Tastone-154 yeast extract, 0.05g/L ferrous 
sulphate, and 3ml/L of a face element solution containing 2g/L of calcium chloride 
dihydrate, 2.2g/L of zinc sulfate septahydrate, 0.5g/L manganese sulfate monohydrate, lg/L 
cuprous sulfate heptahydrate, 0.1 g/L sodium borate decahydrate and 0.5g/L EDTA, was 
brought to a temperature of 30 °C. 

The fermentor was inoculated with a late exponential culture of Escherichia coli 
W31 10 (pCR2-5) grown in a shake flask containing LB, 1% glucose (Sigma Chemical Co., 
St. Louis, MO), and 30 ug/ml chloramphenicol (Sigma Chemical Co., St. Louis, MO) to a 
starting optical density at 600 nm (OD 600 ) of 0.5 to 2.0. The fermentor was agitated at 500- 
1500ipm and air was supplied to the fermentation vessel at 1 .0-15.0 L/min, and the pH of the 
culture was controlled at 7.0 by addition of 20?/ o v/v ammonium hydroxide. After the culture 
reached an OD m of 40, the temperature was reduced to 25 °C and the expression of glucose 
dehydrogenase was induced by the addition of isopropyl-|3-D-thiogalactoside (IPTG) (Sigma 
Chemical Corp., St. Louis, MO) to a final concentration of 1 mM. The culture was grown for 
another 15 hours. After the induction, the cells were harvested by centrifugation and washed 
with 10 mM potassium phosphate buffer, pH 7.0. The cell paste was used directly in the 
downstream recovery process or was stored at -80 °C until use. 



44 



WO 2005/018579 



PCT/US2004/004819 



(3) Glucose dehydrogenase enzyme : 

In an aerated agitated fennentor, 10.0L of growth medium containing 0.528g/L 
ammonium sulphate; 7.5g/L of di-potassium hydrogen phosphate trihydrate; 3.7g/L of 
potassium dihydrogen phosphate; 2g/L of Tastone-154 yeast extract; 0.05g/L ferrous 
sulphate; and 3ml/L of a trace element solution containing 2g/L of calcium chloride 
dihydrate, 2.2g/L of zinc sulfate septahydrate, 0.5 g/L manganese sulfate monohydrate, Ig/L 
cuprous sulfate heptahydrate; 0.1 g/1 sodium borate decahydrate and 6.5 g/L EDTA, was 
brought to a temperature of 30 °C. 

The fennentor was inoculated with a late exponential culture of (pGDHS06 or 
pGDHM02) grown in a shake flask containing LB, 1% glucose (Sigma Chemical Co., St 
Louis, MO), and 30 (ig/ml chloroamphenicol (Sigma Chemical -Co., St. Louis, MO) to a 
starting optical density at 600 nm (OD m ) of 0.5 to 2.0. The fermenter was agitated at 500- 
1500rpm and air was supplied to the fermentation vessel at 1.0-15.0L/min, and the pH of the 
culture was controlled at 7.0 by addition of 20% v/v ammonium hydroxide. After the culture 
reached an OD 6 oo of 40, the temperature was reduced to 25 °C and the expression of glucose 
dehydrogenase was induced by the addition of isopropyl-j3-D-thiogalactoside (IPTG) (Sigma 
Chemical Corp., St. Louis, MO) to a final concentration of ImM. The culture was grown for 
another 15 hours. After the induction, the cells were harvested by centrifugation and washed 
with 1 0 mM potassium phosphate buffer, pH 7..0. The cell paste was used directly in the 
downstream recovery process or was stored at -80 °C until use! 

(4) Formate Dehydrogenase 

In an aerated agitated fermenter, 10.0L of autoclaved minimal medium containing 
3.5g/L of NaNH 4 HP0 4 - 4H 2 0, 7.5g/L of K 2 HP0 4 ' 3 H 2 0, and 3.7g/L of KH 2 P0 4 (see 
Lageveen, et al., 19SS, Appl. Environ. Microbiol . 54:2924. (1988)), 2g/LNH 4 Cl, 0.528g/L 
(NH 4 )2S0 4 , pH 7.0, 5 ml/L of R2 trace elements (see Reisenberg, et al. Appl. Microbiol. 
Biotechnol 1990 34:77), 20ml/L of 10% yeast extract solution in water, 5 ml/L 1 M MgS0 4 , 
40 ml/L of 50% glucose solution in water were added. The temperature of the medium was 
brought to 30 °C. 

Chloroamphenicol was added from a concentrated stock solution, to a final 
concentration of 30 |ig/ml. The fermenter was inoculated with an overnight culture of 
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Escherichia coli W3110 (pFDHPs3 or PFDHCbl3) grown in a shake flask containing the 
above minimal medium with R2 trace element solution, pH 7.0, 0.2% yeast extract, 1% 
glucose, and 30 ng/ml chloroamphenicol to a starting optical density at 600 nm (OD 6 oo) of 
0.04 - 0. 1 . The air was supplied to the fermentation vessel at 5.0 L/min. the pH of the culture 
5 was maintained at 7.0 using a concentrated solution of potassium hydroxide in water. The 
culture was grown to an OD 600 of 12-15, at which time a feed solution of 50% glucose, 6% 
ammonium chloride and 0.5% magnesium sulfate was initiated at a rate that resulted in a final 
dissolved oxygen concentration of 30-40% of air saturation. The feed pump rate was 
controlled such that the dissolved oxygen in the fermenter was maintained around 30% at 
1 0 airflow rate of 1 0 L/min and agitation rate of 600 rpm. After the culture reached an OD 60 o of 
1 5 and had been exposed to the feeding regimen for a few hours, the expression of the 
formate dehydrogenase was induced by the addition of 1 mM of IPTG. The culture was 
grown for another 8-18 hours before it was harvested by centrifugation. 

15 Example 3: Enzyme Preparation 

(1) Ketoreductase 

The cell paste was washed by suspending 1 volume wet weight of cell paste in 3 
volumes of lOOmM Tris/sulfate (pH 7.2) follwed by centrifugation at 5000g for 40 minutes in 
a Sorval 12BP. The washed cell paste was suspended in 2 volumes of lOOmM Tris/sulfate 

20 (pH 7.2). The intracellular KRED was released from the cells by passing the suspension 
through a homogenizer in two passes using a pressure of 14,000 psig for the first pass and 
8,000 psig for the second pass. The lysate was warmed to room temperature, then a 10% w/v 
solution of polyetliyleneimine (PEI), pH 7.2, was added to the lysate to a final PEI 
concentration of 0.75% w/v and stirred for 30 minutes. The treated homogenate was 

25 centrifuged at 1 0,000 rpm in a Beckman lab centrifuge for 60 minute's. The supernatant was 
decanted and dispensed in shallow containers, frozen at -20 °C and lyophilized. 

(2) Glucose Dehydrogenase 

The cell paste was washed by suspending 1 volume wet weight of cell paste in 3 
30 volumes of 1 OOmM Tris/sulfate (pH 7.2) followed by centrifugation at 5000g for 40 minutes 
in a Sorval 12BP. The washed cell paste was suspended in 2 volumes of 1 OOmM Tris/sulfate 
(pH 7.2). The intracellular HHDH was released from the cells bypassing the suspension 
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through a homogenizer in two passes using a pressure of 14,000 psig for the first pass and 
8,000 psig for the second pass. The homogenate was centrifuged at 10,000 rpm in a 
Beclanan lab centrifuge for 60 minutes. The supernatant was decanted and dispensed in 
shallow containers, frozen at -20 °C and lyophilized. 

5 

(3) Halohydrin Dehalogenase 

The cell paste was washed by suspending 1 volume wet weight of cell paste in 3 
volumes of lOOmM Tris/sulfate (pH 7.2) followed by centrifugation at 5000g for 40 minutes 
in a Sorval 12BP. The washed cell paste was suspended in 2 volumes of lOOmJVI Tris/sulfate 

1 0 (pH 7.2). The intracellular HHDH was released from the cells by passing the suspension 
through a homogenizer in two passes using a pressure of 14,000 psig for the first pass and 
8,000 psig for the second pass. The cell lysate was allowed to cool to 4 °C between passes 
through the homogenizer. The homogenate was centrifuged at 10,000 rpm in a Beckman lab 
centrifuge for 60 minutes. The supernatant was decanted and dispensed in shallow 

1 5 containers, frozen at -20 °C and lyophilized to a powder that was stored at -80 °C. 

To assess the quality of the preparation after fermentation, cell lysate containing the 
expressed halohydrin dehalogenase enzyme was assayed according to the following protocol. 
Approximately 50 ^1 of clarified cell lysate in lOOmM Tris-S0 4 , lOOmM NaCN, pH 8.0 was 
mixed with lOmM ethyl-(S)-4<;hloro-3-hydroxybutyrate (Sigma Aldrich, St. Louis, MO or 

20 prepared in accordance with the ketoreductase-catalyzed methods described herein). The 
total reaction volume was 0.2 ml. The reaction was incubated at room temperature for 30 
min to 1 hour. The reaction was extracted with 7 volumes of ethyl acetate and the organic 
layer removed to a 1.8 ml GC vial. The organic layer was analyzed by GC for presence of 
the ethyl-(R)-4-cyano-3-hydroxybutyrate product. The amount of product produced was 

25 determined by comparison to a standard curve prepared and analyzed under the same 
conditions. 

(4) Formate Dehydrogenase 

Cell lysate containing expressed formate dehydrogenase was prepared by 
30 homogenization of cell paste in 1 volume 100 mM triethanolamine ,(pH 7.0) at 4°C. The cell 
lysate was allowed to cool to 4°C between passes through the homogenizer. Cell lysate was 
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15 



clarified by centrifugation at 4°C. The clarified lysate was assayed as described in Exampk 
4. 



Example 4: Characterization of Enzvme Activity 
(1) . Ketoreducta.se fKRED') 

To a solution of ethyl 4-chloro-3-ketobutyric acid ester (10 raM) in 100 mM 
potassium phosphate buffer <pH 7.0) was added the ketoreductase enzyme as a predissolved 
solution in the same buffer. The reaction was initiated by addition of NADPH (1 mM final) 
and the course of reaction was followed by measurement of the decrease of absorbance at 340 
nm. This absorbance corresponds to the NADPH concentration. The results were plotted as 
Absorbance units (NADPH) vs. time, and the slope of the plot determined (Absorbance 
units/min). The slope of the Absorbance vs. time plot was converted to concentration units 
using the extinction coefficient of NADPH, and the activity of the ketoreductase was 
determined in units of umol (NADPH consumed)/min/mg (total ketoreductase catalyst). The 
measurement can also be performed using fluorescent detection utilizing an excitation of 340 
nm for NADPH with emission measured at 455 nm. Other substrates of interest may be 
substituted for ethyl 4-chloro-3-keto-butyric acid ester to evaluate ketoreductase activity with 
respect to other substrates. 

(2) Glucose Dehydrogenase CGDH) 

To a solution of 50 mM glucose in 1 00 mM potassium phosphate buffer (pH 7.0) was 
added the glucose dehydrogenase enzyme as a predissolved solution in the same buffer. The 
reaction was initiated by addition of NADP (1 mM final) and the course of reaction was 
followed by measurement of the increase of absorbance at 340 nm or of the fluorescence 
(excitation 340 nm, emission 455 nm). The results were plotted as Absorbance units 
(NADPH) vs. time, and the slope of the plot determined (Absorbance units/min). The slope 
of the Absorbance vs. time plot was converted to concentration units using the extinction 
coefficient of NADPH (see (1) above), and the activity of the glucose dehydrogenase was 
determined in units of umol (NADPH created)/min/mg (total glucose dehydrogenase 
catalyst). 

(3) Halohvdrin dehalogenase fHHDFH 
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To a solution of ethyl (S)-4-chloro-3-hydroxybutyrate (10 mM) in 300 mM potassium 
phosphate, 300 mM NaCN buffer (pH 8.0) was added the halohydrin dehalogenase enzyme 
as a predissolved solution in the same buffer. Over time, aliquots of the mixture were 
withdrawn and extracted with three volumes of ethyl acetate. The organic layer was then 
5 analyzed by gas chromatography (GC), as described hereinbelow in Example 6. Samples 
were taken at various time points, and the peak area of the product cyanohydrin, ethyl 
(R)-4-cyano-3-hydroxybutyrate, was plotted as a function of time. The peak areas were 
converted to concentration units using a standard curve that was prepared for the ethyl (RV4- 
cyano-3-hydroxybutyrate. Activity of the halohydrin dehalogenase was determined in units . 
10 of jiimol (cyanohydrin produced)/min/mg (total halohydrin dehalogenase catalyst). Other 
nucleophiles and/or substrates of interest may be substituted to evaluate halohydrin 
dehalogenase activity with respect to other nucleophiles and/or substrates. 

(4) Formate Dehydrogenase 

15 To a solution of 150 mM formate in 100 mM triethanolamine buffer (pH 7.0) was 

added the formate dehydrogenase enzyme as a predissolved solution in the same buffer. The 
reaction was initiated by addition of NAD (2 mM final) and the course of reaction was 
followed by measurement of the increase of absorbance at 340 nm or of the fluorescence 
(excitation 340 nm, emission 455 nm). The results were plotted as Absorbance units 

20 (NADH) vs. time, and the slope of the plot determined (Absorbance units/min). The slope of 
the Absorbance vs. time plot was converted to concentration units using the extinction 
coefficient of NADH, and the activity of the formate dehydrogenase was determined in units 
of jumol (NADH created)/min/mg (total formate dehydrogenase catalyst). 

25 Example 5: Preparation of ethyl (RV4-cyano-3-hvdroxybutyrate from ethyl 
4-chloroacetoacetate (via ethyl (S>4-cMoro-3-hvdroxvbutvrateY ' 

To a well-stirred solution of 1 00 mM potassium phosphate buffer, 500 mM NaCl, 
pH 7 (1 L) at room temperature was added glucose (160g, 830 mmoles, 1.1 equiv). To this 
was added ketoreductase SEQ ID NO: 2 (0.9g), glucose dehydrogenase S06 SEQ ID NO: 10 

30 (0.5 g) and NADP (0.5g) as lyophilized powders. Once dissolved, butyl acetate (500 mL) . 
was added to form an emulsion. To this emulsion was added a solution of ethyl 
4-chloroacetoacetate (lOOg, 608 mmoles) in butyl acetate (500 mL), dropwise over 3 hours. 
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The pH was maintained between 6.8 and 7 by an automatic titrater that dispensed Na 2 C0 3 
(2M in water, about 1 60 niL total). After 40 hours the automated addition of the base had 
ceased and there was no residual starting material by gas chromatography. The layers were 
separated, and the aqueous phase was washed with ethyl acetate (500 mL). The combined 
organics were dried over anhydrous sodium sulfate, filtered and evaporated on a rotary 
evaporator, to give essentially pure (-97%) ethyl (S)-4-chloro-3-hydroxybutyrate. 

To a well stirred solution of ethyl (S)-4-cWoro3-hydroxybutyrate (8.25 g, 50 mmoles) 
in 300 mM potassium phosphate buffer, 300 mM NaCN pH 8.0 (1L) at 30 °C was added 
halohydrin dehalogenase SEQ ID NO: 14 (9 g) as a Iyophilized powder. After fifty seven 
hours the mixture was washed with ethyl acetate (2 times 250 mL) and the combined 
organics dried over anhydrous sodium sulfate. The mixture was filtered and evaporated on a 
rotary evaporator to give essentially pure ethyl (R)-4-cyano-3-hydroxybutyrate, as 
determined using the gas chromatography method and elution time data described in Example 
6, hereinbelow. 

This example shows the process of the invention wherein a 4-cyano-3-hydroxybutyric 
acid ester (ethyl (R)-4-cyano-3-hydroxybutyrate) is produced by contacting a 4-halo- 

3- hydroxybutyric acid ester (ethyl (S)-4-chloro-3-hydroxybutyrate) with a halohydrin 
dehalogenase and cyanide (provided by a cyanide salt, NaCN). It further shows the process 
of the invention wherein the 4-halo-3-hydroxybutyric acid ester is provided by contacting a 

4- halo-3-ketobutyric acid ester (ethyl 4-chloroacetoacetate) with a ketoreductase, a cofactor 
(NADPH, provided as NADP), and a cofactor regeneration system (glucose and glucose 
dehydrogenase). It further shows the overall production of nonracemic chiral ethyl (R)-4- 
cyano-3-hydroxybutyrate from achiral ethyl 4-chloroacetoacetate in high e.e. and in high 
purity without extensive purification procedures. 

Example 6: Characteriza tion of Ethvl fRV4-cvano-1-hvdroxvbutY rate 

The ethyl 4-cyano-3-(R)-hydroxybutyrate produced in Example 5 was analyzed using 
gas chromatography with flame ionization (FID) detection using an Agilent HP-5 column, 30 
m long, 0.25 (Jm inner diameter, using the following program: 1 minute at 1 00°C, 
5°C/minute for 10 minutes; 25'C/minute for 2 minutes; then 2 minutes at 200°C. Inlet and 
outlet temperatures were both 300°C, and the flow rate was 2 ml/minute. Under these 
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conditions, ethyl (R)-4-cyano-3-hydroxybutyrate elutes at 6.25 minutes, ethyl (S)-4-chloro-3- 
hydroxybutyrate elutes at 4.5 minutes, and ethyl 4-chloroacetoacetate elutes at 4.1 minutes. 

Chemical purity of the species was measured using the integrated peak areas from the 
gas chromatography results. 
5 Enantioselectivity of the halohydrin dehalogenase (HHDH) with respect to ethyl 

(R)-4-cyano-3-hydroxybutyrate was measured by gas chromatography and FID detection 
using a Restek gammaDex SA column (30 m long, 0.32 pm inner diameter) using the 
following program: 25 minutes at 165°C and flow rate at 2 ml/min. Inlet and outlet 
temperatures were both at 230°C. Under these conditions ethyl (R)-4-cyano-3- 
10 hydroxybutyrate elutes at 19.6 minutes and ethyl (S)-4-cyano-3-hydroxybutyrate elutes at 
19.2 minutes. 

Example 7: Preparation of Ethyl f SV4-chloro-3-hvdroxvbutyrate from Ethyl 4-chloro- 
acetoacetate. 

15 To a 3-necked jacketed 3L flask equipped with a mechanical stirrer and connected to 

an automatic titrater by a pH electrode and a feeding tube for addition of base, was charged 
triethanolamine (6.6 mL) and H 2 0 (492 mL) to make 100 mM triethanolamine solution. The 
pH was adjusted to 7 with 37% HC1. Then, D-Glucose (125 g) was added. The water • 
circulating to the flask jacket was set to 30 °C. After 1 0 minutes, ketoreductase SEQ ID NO: 

20 2 (5.7g) and glucose dehydrogenase S06 SEQ ID NO: 10 (3.1 g) powder were added. After 
10 minutes, (3-NAD (125 mg) was added and the resulting mixture was allowed to stir for 5 
minutes. Then, butyl acetate (250 mL) was charged. Using an addition funnel, 2.4 M ethyl 
4-chloroacetoacetate (250 mL, 100 g in 167 mL of butyl acetate) was slowly added over 3 
hrs. The pH was maintained at 7 by the automatic titrater by the addition of 2 M Na 2 C03 (152 

25 mL) over 15 hrs. Subsequently, gas chromatography of a reaction sample showed complete 
conversion to product. Celite (16 g) was added and the reaction mixture was allowed to stir 
for 1 0 minutes. The solution was filtered through a celite pad and the organic layer was 
separated. The aqueous layer was extracted with butyl acetate (2x 200 mL). The organic 
layers were combined and the solvent removed under vacuum by rotary evaporation to obtain 

30 87 g of the ethyl (S) 4-chloro-3 -hydroxybutyrate. The enantiomeric excess was >99%, as 
determined after its conversion to ethyl (R)-4-cyano-3-hydroxybutyrate in Example 8. 
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Example 8: Preparation of Ethyl (TlM-cyano-3-hydroxvbutvrate from Ethvl fSV4-chloro-3- 
hvdroxvbutvrate 

To a 3-necked jacketed 3L flask equipped with a mechanical stirrer and connected to 
an automatic titrater by a pH electrode and a feeding tube for addition of base, was charged 
5 H 2 O.(1200 mL) 5 NaCN (37.25 g) and NaH 2 P0 4 (125 g) to bring the solution to pH 7. The 
water circulator was set to 40 °C. After 10 minutes, halohydrin dehalogenase SEQ ID NO: 
32 as cell lysate (250 mL) was added. The reaction mixture was allowed to stir for 5 minutes. 
Using an addition funnel, ethyl (S)-4-chloro-3-hydroxybutyrate (45 g of the material from 
Example 7) was slowly added over 1 hour. The pH was maintained at 7 by the automatic 

10 titrater by the addition of 10 M NaOH (27 mL) over 17 hrs. Subsequently, gas 

chromatography of a reaction sample showed complete conversion to product. Celite (16 g) 
was added to the flask, which was then connected to a diaphragm, whose exhaust is bubbled 
into 5M NaOH (200 mL), to remove HCN. The mixture was heated to 60 °C under 100mm' 
Hg pressure. After 1 hour a submerged air bubbler was added to the solution to aid the 

1 5 removal of the HCN. After 3 hours, an HCN detector indicated less than 5 ppm HCN in the 
off-gas. The mixture was allowed to cool to room temperature, then filtered through a celite 
pad. The filtrate was extracted with butyl acetate (3x S00 mL) and the combined organic 
layers filtered through a pad of activated charcoal. The solvent was removed under vacuum 
by rotary evaporation to provide 2S.5 g of ethyl (R)-4-cyano-3-hydroxybutyrate. The purity 

20 was 98% (w/w) by HPLC and the enantiomeric excess was >99% (by chiral GC, the S 
enantiomer was undetectable). 

Example 9: Preparation of Ethvl ( Sy4-cMoro~3-hydroxvbutvrate from Ethvl 4-chloro- 
acetoacetate. 

25 To a 100 mL vessel connected to an automatic titrater by a pH electrode and a feeding 

tube for addition of base was charged a solution of glucose (7.5 g) in 100 mM 
triethanolamine pH 7 buffer (25 mL). To this solution was charged ketoreductase SEQ ID 
NO: 42 (100 mg); 50 mg GDH SEQ ID NO: 66 and NADP (6.25 mg). Butyl acetate (10 ml) 
was then charged. Then, ethyl 4-chloroacetoacetate (6 g) in butyl acetate (10 mL) was 

30 charged. The pH was maintained at 7 by the automatic titrater by the addition of 4M NaOH 
(7.5 mL) over 7 hrs. A sample of the reaction mixture was extracted with an equal volume 
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of butyl acetate and the organic layer was analyzed by GC. The analysis showed 99% 
conversion of the ethyl 4-chloroacetoacetate to ethyl (S)-4-chloro-3-hydroxybutyrate. 

Example 1 0: Preparation of Ethvl ( Sy4-chloro-3-hvdroxvbutvrate from Ethvl 4-chloro- 
5 acetoacetate. 

The procedure was identical to Example 9 with the exceptions that 400 mg of the 
ketoreductase SEQ ID NO: 42 was used and NAD+ (12.5 mg) was added in place of the 
NADP. The addition of the NaOH solution by the automatic titrater was complete in 1 1 
hours and the GC analysis showed 99% conversion of the ethyl 4-chloroacetoacetate to ethyl 
1 0 (S)-4-chloro-3-hydroxybutyrate. 

Example 1 1 : Preparation of Ethvl ( SV4-chloro-3-hvdroxvbutvrate from Ethvl 4-chloro- 
acetoacetate. 

To a 100 mL vessel connected to an automatic titrater by a pH electrode and a feeding 
15 tube for addition of base was charged a solution of glucose (12. g) in water (30 mL). To tiiis 
solution was charged ketoreductase SEQ ID NO: 42 (100 mg); 50 mg GDH SEQ ID NO: 66 
and NADP (6.25 mg). Butyl acetate (10 ml) was then charged. Ethyl 4-chloroacetoacetate 
(1 0 g) was then charged via syringe pump as follows: 1 mL was charged rapidly and the 
remainder was then charged at a rate of 1 mL/hr). The pH was maintained at 7 by the 
20 automatic titrater by the addition of 4M NaOH over 1 8 Iiouts hrs. The stirring was stopped 
and the phases allowed to separate. The organic layer included some emulsion. The organic 
layer, including some emulsion, was separated and washed with 10 mL of water. The 
combined aqueous layers were extracted twice with 20 mL of butyl acetate. The organic 
extracts were combined and rotary evaporated under vacuum to remove water. Additional 
25 butyl acetate was added during the evaporation to help remove the water. When the water 
was removed the butyl acetate solution was decanted from solids in the flask. Evaporation of 
the solvent under vacuum then gave S.S5 g of ethyl (S)-4-chloro-3-hydroxybutyrate (87.4% 
yield) of very good purity. 

Example 12: Preparation of Ethvl rRV4-cvano-3-hvdroxvbutvrate from Ethvl fSV4-chloro- 
30 3-hvdroxvbiitvrate 

To a 170mL vessel connected to an automatic titrater by a pH electrode and a feeding 
tube for addition of base was charged NaCN (1 .5 g, 3 1 mmol) and water (50 mL). The vessel 



53 



WO 2005/018579 PCT/US2004/004819 



was sealed and the headspace was deaerated with nitrogen. The pH was adjusted to 7 by the 
addition of cone. H?S0 4 (0.9 mL). The reaction mixture was heated to 40 °C and treated with 
a solution of halohydrin dehalogenase SEQ ID NO: 32 (1.2 g in 10 mL water containing 42 
\iL of 14M ji-mercaptoethanol). Then, ethyl (S)-4-cbloro-3-hydroxybutyrate (1.8 g, 10.8 

5 mmol) was added via syringe. The automatic titrater maintained the pH at 7 by the addition 
of 2M NaOH. After 1 5 hr the reaction was complete and a total of 4.6 mL 2M NaOH had 
been added. A sample of the reaction mixture was extracted with an equal volume of butyl 
acetate. GC analysis of the organic extract showed the conversion of the ethyl (S)-4-chloro- 
3-hydroxybutyrate to ethyl (R)-4-cyano-3-hydroxybutyrate was >99%. 

0 ' 

Example 13: Preparation of Ethyl (Ry4-cvano-3-hvdroxybutvrate from Ethvl rSV4-chIom- 
3 -h vdrox vbutvrate 

The procedure was identical to Example 12 with the exception that 4M NaCN was 
used as the base instead of the 2M NaOH. After 8 hrs, the reaction was complete and a total 
.5 of 2.3 mL 4M NaCN had been added. By GC analysis, the conversion of the ethyl (S)-4- 
chloro-3-hydroxybutyrate to ethyl (R>4-cyano-3-hydroxybutyrate was >99%. 

This example shows the process of the invention using an alkali cyanide as base to 
maintain both the pH and the cyanide concentration of the reaction mixture constant. 

!0 Example 14 : Preparation of Ethvl (RV4-cvano-3-hvdroxvbutYrate from Ethvl (SV4-chloro-3- 
hvdroxvbutvrate 

. To a 250 mL vessel connected to an automatic titrater by a pH electrode and a feeding 
tube for addition of base (7.5 M NaOH) was charged water (83,5 mL) and 0.7 g of halohydrin 
dehalogenase SEQ ID NO: 24, The mixture was stirred for 30 minutes. The titrater was 

IS activated and set to maintain pH 7. Then, 25% aqueous HCN (9.26 ml, 8.6 g) was charged 
over 20 minute to make a 2.3% HCN solution. The mixture was heated at 40 °C for 10 
minutes, then ethyl (S)-4-chloro-3-hydroxybutyrate (5 g) was charged over 1 hour. The 
automatic titrater maintained the pH at 7 by the addition of 2M NaOH. After 20 hrs, GC 
analysis of a butyl acetate extract of a reaction sample showed the conversion of the ethyl 

50 (S)-4-chloro-3-hydroxybutyrate to ethyl (R)-4-cyano-3-hydroxybutyrate was 95%. 

This example shows the process of the invention using aqueous hydrocyanic acid as 
the source of cyanide. 
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Example 15: Preparation of Ethyl fRy4-cvano-3-hydroxvbutyrate from Ethyl fSV4-chloro-3- 
hvdroxvbutvrate 

To a 20 mL screw-cap vial was added NaCN (250 mg) and NaH 2 P0 4 (S30 mg). 
5 Water (1 0 mL) was added followed by halohydrin dehalogenase SEQ ID NO: 32 as 
lyophilized powder (200 mg). Then ethyl (S)-4-chloro-3-hydroxybutyrate (300 mg) was 
added. The vial was capped and heated in an oil bath at 40 °C. After 4 hours, GC analysis of 
a butyl acetate extract of a reaction sample extract showed of 54% conversion of the ethyl 
(S)-4-chloro-3-hydroxybutyrate to ethyl (R)-4-cyano-3-hydroxybutyrate. After 72 hrs, the 
10 GC analysis showed complete conversion. 

Example 16: Preparation of Ethyl fS>4-cyano-3-hvdroxybutvrate from Ethyl fRV4-chloro-3- 
hvdroxvbutvrate 

The procedure was identical to that of Example 15 with the exceptions that the 
1 5 (R)-enantiomer of the Ethyl 4-chloro-3-hydroxybutyrate was reacted instead of the 

(S)-enantiomer and the quantities of all reaction components were halved. After 1 hour 
reaction time, the GC analysis showed 55% conversion of the ethyl (R)-4-chloro- 

3- hydroxybutyrate to ethyl (S)-4-cyano-3-hydroxybutyrate. 

This example in combination with preceding examples shows that the process of the 
20 invention may be used to convert either enantiomer of the 4-halo-3-hydroxybutyric acid ester 
to the corresponding enantiomer of the 4-cyano-3-hydroxybutyric acid ester. 

Example 17: Preparation of Methyl fSV4-chloro-3-hvdroxvbutyrate from Methyl 4-chloro- 
acetoacetate 

25 The procedure was identical to that of Example 9 with the exceptions that an 

equimolar amount of methyl 4-chloroacetoacetate was reacted instead of the ethyl 

4- chloroacetoacetate and the enzymes used were ketoreductase SEQ ID NO: 50 and glucose 
dehydrogenase SEQ ID NO: 62. The reaction was completed in 1 1 hrs and the GC analysis 
showed >99% methyl (S)-4-cMoro-3-hydroxybutyrate. The product was isolated by 

30 extraction into butyl acetate and solvent evaporation and its identity confirmed by *H and 13 C 
NMR. 
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Example IS: Preparatio n of Methyl fRV4-cvano-3-hvdroxybutyrate from Methyl (SV4- 
chloro-3-hvdroxvbutvrate 

The procedure was identical to that of Example 1 6 with the exception that an 
equimolar amount of methyl (S)-4-chloro-3-hydroxybutyrate (prepared by Example 17) was 
5 reacted instead of ethyl (R)-4-cMoro-3-hydroxybutyrate. After 1 hour reaction time, the GC 
analysis showed 38% conversion of the methyl (R)-4-chloro-3-hydroxybutyrate to methyl 
(S)-4-cyano-3-hydroxybutyrate. The product was characterized by l E and 13 C NMR. 

Example 19: Preparation of Eth yl fRV4-cvano-3-hvdroxvbutvrate from Ethvl fSV4-bromn- 
i0 3-hvdroxvbutvrate. 

The procedure was identical to that of Example 16 with the exception that an 
equimolar amount of ethyl (S)-4-bromo-3-hydroxybutyrate was reacted instead of ethyl (R)- 
4-chloro-3-hydroxybutyrate. After 1 hour reaction time, the GC analysis showed 90% 
conversion of the ethyl (S)-4-bromo-3-hydroxybutyrate to ethyl (S)-4-cyano-3- 
1 5 hydroxybutyrate. The product was characterized by *H and I3 C NMR. 

This example shows that the process of the invention wherein the halo substituent of 
the 4-halo-3-hydroxybutyric acid ester is bromine. 

Example 20: Preparation of Ethyl 3-hvdroxvbutvrate from Ethyl acetoacetate. 

20 The procedure was identical to that of Example 17 with the exceptions that an 

equimolar amount of ethyl acetoacetate was reacted instead of the methyl 
4-chloroacetoacetate and 200 mg of ketoreductase SEQ ID NO: 50 and 100 mg of glucose 
dehydrogenase SEQ ID NO: 62 were used. The reaction was completed in 6 hrs. The 
product was isolated by extraction into butyl acetate and solvent evaporation and 

25 characterized by ] H and 13 C NMR. 

In combination with preceding examples, this example demonstrates that 
ketoreductase enzymes that have activity for the reduction of ethyl acetoacetate to ethyl 
3-hydroxybutyrate are useful for the reduction 4-halo-3-ketobutyric acid esters to 4-halo-3- 
hydroxybutyric acid esters in embodiments of this invention. 
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Example 21 : pH profiles of enzymatic and nonenzymatic test reactions of ethvl 4-chloro 
3-hvdroxvbutvrate with cyanide 

Aqueous solutions containing 25 mg/mL sodium cyanide were prepared at pH 5.0, 
6.0, 7.0, 7.5, 8.0, 8.5, and 9.0 by the addition of 85% phosphoric acid while monitoring with 
pH meter. 5 mL of each solution was charged to a separate 20 mL screw cap vial. 
Halohydrin dehalogenase SEQ ID NO: 38 (20 mg) was added to each vial, followed by ethyl 
(S)-4-chloro-3-hydroxybutyrate (50 mg, 0.30 mmoles). For nonenzymatic reactions 
experiments, the procedure was identical with the exception that the enzyme was omitted. 
The vials were capped and heated in an oil bath at 55 °C for 3 hrs, then removed and cooled 
to room temperature. A 0.4 mL sample of each reaction mixture was extracted with 1 mL 
butyl acetate and the extracts were analyzed by gas chromatography. 

The analyzed amounts of substrate and products in each vial are given in Table I, and 
graphed vs. pH in Figure 1 . In both, chlorohydrin means ethyl (S)-4-chloro-3- . . 

hydroxybutyrate, cyanohydrin means ethyl (R)-4-cyano-3-hydroxybutyrate, and crotonate 
means ethyl 4-hydroxycrotonate. In the Table, ND means not detected. 



Table I: Millimoles chlorohydrin, cyanohydrin and crotonate by-product analyzed in 
test reactions with and without halohydrin dehalogenase. See Example 21 





without halohydrin dehalogenase 


with halohydrin dehalogenase 


PH 


mmol 
chlorohydrin 


mmol 
cyanohydrin 


mmol 
crotonate 


mmol 
chlorohydrin 


mmol 
cyanohydrin 


mmol 
crotonate 


5.0 


0.33 


ND 


ND 


0.27 


ND 


ND 


6.0 


0.29 


ND 


ND 


0.07 


0.20 


ND 


7.0 


0.30 


ND 


ND 


0.01 


0.28 


ND 


7.5 


0.31 


ND 


ND 


0.004 


0.30 


ND 


8.0 


. 0.30 


0.01 


ND 


0.002 


0.29 


ND 


8.5. 


0.21 


0.05 


0.001 


0.001 


0.24 


ND 


9.0 


0.11 


0.10 


0.002 


0.001 


0.21 


ND 
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The pHs of the final test reaction mixtures were remeasured. For the mixtures 
including halohydrin dehalogenase with initial pHs of 7 or above (being the mixtures in 
which near complete conversion of the chlorohydrin to the cyanohydrin occurred, the final 
mixture pHs were 0.4 to 0.6 pH units below the initial pHs. The other mixtures showed much 
5 lesser changes in pH from then initial values. 

These data show that under these reaction conditions and time, no measurable 
nonenzymatic reaction of the ethyl 4-chloro-3-hydroxybutyrate with cyanide occurred at any 
tested pH less than 8. At pH 8 and above, increasing nonenzymatic reaction with cyanide to 
form ethyl 4-cyano-3-hydroxybutyrate occurred with increasing pH and was accompanied by 

1 0 increasing formation of ethyl 4-hydroxycrotonate by-product. In contrast, the enzymatic 
reaction with halohydrin dehalogenase occurred at all the tested pH's greater than 5 and with 
no detectable formation of ethyl 4-hydroxycrotonate at any tested pH. Additionally, for both 
enzymatic and nonenzymatic test reactions at pH greater than 8, the mole total of the GC- 
analyzed products decreased from the initial 0.30 mmoles provided (as ethyl 4-chloro-3- 

1 5 hydroxybutyrate reactant) indicating the increasing formation of non-analyzable by-products 
with increasing pH greater than 8. It was separately established that the ester group of the 
reactant and product are increasingly hydrolyzed to carboxylic acid groups at pHs greater 
than 8 and that the resulting carboxylic acids are not extracted in to the extracts of reaction 
mixture samples that are analyzed by GC. See Example 22. 

20 

Example 22: N onenzymatic hydrolysis of ethvl 4-cvano-3-hvdroxvbutvrate. 

Aqueous phosphate solutions were prepared at pH 7.0, 7.5, 8.0, 8.5, and 9.0 by 
dissolving 0.48 g of NaH 2 P0 4 in 40 mL water and adjusting the pH by addition of 2M NaOH 
while monitoring with pH meter. 5 mL of each solution was charged to a separate 20 mL 

25 screw cap vial. Then, ethyl (R)-4-cyano-3-hydroxybutyrate (46 mg, 0.29 mmol) was added. 
The vials were capped and heated in an oil bath at 55 °C for 3 hrs, then cooled to room 
temperature. A 0.4 mL of each reaction mixture was extracted with 1 mL butyl acetate and 
the extracts were analyzed by GC. For an external standard a duplicate of the pH 7.0 mixture 
was freshly prepared and immediately extracted. The analyzed amounts of ethyl 4-cyano-3- . 

30 hydroxybutyrate in each vial are given in Table H No product of its hydrolysis was detected 
in the reaction sample extracts. It was separately established that the carboxylic acid product 
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of hydrolysis of this ester is not extracted into the extracts of the reaction samples that are 
analyzed by GC. 



Table II: Millimoles chlorohydrin and 
cyanohydrin analyzed in test 
hydrolysis reactions. See Example 22 



pH 


mmol cyanohydrin- 


7.0 


0.29 


7.5 


0.2S 


S.O 


0.27 


8.5 


0.26 


9.0 


0.24 



5 The pHs of the final test mixtures were remeasured. The mixtures with initial pHs of 

S.O, 8.5, and 9.0 each had a final pH of 7.4. The mixture with an initial pH of 7.5 had a final 
pH of 7.3, and the mixture with an initial pH of 7 was unchanged. This evidences the 
production of carboxylic acid in the higher pH samples causing neutralization of the solutions 
into the phosphate buffering range. 
10 This example in combination with Example 21 shows that ethyl 4-cyano-3-hydroxy- 

butyrate is increasingly hydrolyzed with increasing pH at the pHs greater than 8 where it can 
be produced by nonenzymatic reaction of ethyl 4-chloro-3-hydroxybutyrate with cyanide. 

Example 23: Preparation of ethyl rRV4-cvano-3-hvdroxvbutvrate from ethyl 

15 4-chloroacetoacetate (via ethyl ('SV4-chIoro-3-hydroxvbutvrate). 

To a 100 mL vessel connected to an automatic titrater by a pH electrode and a feeding 
tube for addition of base (4M NaOH) was charged a solution (25 mL) of glucose (7.5 g) in 
100 mM triethanolamine buffer, pH 7. To this solution was charged ketoreductase SEQ ID 
NO: 50 (50 mg), glucose dehydrogenase SEQ ID NO: 62 (20 mg) and NADP (1.5 mg). Butyl 

20 acetate (10 ml) and ethyl 4-chloroacetoacetate (6 g) in additional butyl acetate (10 mL) were 
then charged. The pH was maintained at 7 by the automatic titrater by the addition of 4M 
NaOH to the stirring mixture over 13 hrs. The phases were then allowed to separate for 30 
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minutes and the organic layer (25 mL), containing the ethyl (S)-4-chloro-3-hydroxybutyrate 
intermediate, was removed. 

To a 170 mL vessel connected to an automatic titrater by apH electrode and a feeding 
tube for addition of base (2M NaOH) was charged sodium cyanide (1 .5 g) followed by water 
(50 mL). The vessel was sealed and the headspace was deaerated with nitrogen. The pH was 
adjusted to 7 using concentrated sulfuric acid (0.9 mL). The mixture was heated to 40 °C and 
treated with a solution of halohydrin dehalogenase SEQ ID NO: 32 (1.2 g) in 10 mL water 
containing 42 uL of 14M p-mercaptoethanol). Then, the organic layer (25 mL) containing 
ethyl (S)-4-chloro-3-hydroxybutyrate from the first step was added via syringe. The pH was 
maintained at 7 by the automatic titrater by the addition of 2M NaOH to the stirring mixture. 
After 15 hr, the conversion of ethyl (S)-4-chloro-3-hydroxybutyrate to ethyl (R)-4-cyano- 

3- hydroxybutyrate was 33% as indicated by the cumulative addition of 5 mL of the base (15 
mL expected for complete conversion). 

Example 24: Preparation of ethvl fRl4 -cvano-3-hvdro xvbutvrate from P th y l 

4- chloroacetoacetate fvi a ethvl fS^-chloro^-hvdroxvbutvrat^ 

To a 20 mL screw cap vial was added NaCN (125 mg, 2.55 mmol), NaH 2 P0 4 (415 
mg, 3.46 mmol) and glucose (750 mg, 3.S mmol). Water (5 mL) was added followed by 
NADP (2 mg), ketoreductase SEQ ID NO: 56 (50 mg), glucose dehydrogenase SEQ ID NO: 
62 (50 mg), and halohydrin dehalogenase SEQ ID NO: 32 (100 mg). Then ethyl 4-chloro- 
. acetoacete (24 mg, 0. 1 5 mmol) in 0.5 mL butyl acetate was added. The vial was capped and 
heated in an oil bath at 30 °C After 1 hr, GC analysis of a butyl acetate extract of a reaction 
sample showed 100% conversion of the ethyl 4-chloro-acetoacete to ethyl (S)-4-chloro- 

3- hydroxybutyrate, at 96% selectivity, and ethyl (R)-4-cyano-3-hydroxybutyrate at 4% 
selectivity. Then, the reaction vial was heated to 40 °C for 15 hrs. GC analysis of a butyl 
acetate extract then showed 2% of the ethyl (S)-4-chloro-3-hydroxy-butyrate remaining, with 
overall 98% yield of ethyl (R)-4-cyano-3-hydroxybutyrate based on the starting ethyl 

4- chloroacetate. . 

This example shows the process of the invention wherein a 4-cyano-3-hydroxybutyric 
acid ester (ethyl 4-cyano-3-(R)-hydroxybutyrate) is produced, via an intermediate 4-halo-3- 
' hydroxybutyric acid ester (ethyl 4-chloro-3-(S)-hydroxybutyrate) ) by contacting a 4-halo-3- 
ketobutyric acid ester (ethyl 4-chloroacetoacetate) with a ketoreductase, a cofactor (NADPH, 
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provided as NADP) a cofactor regeneration system (glucose and glucose dehydrogenase), a 
halohydrin dehalogenase, and cyanide (provided by an cyanide salt, NaCN) with all the 
reactants simultaneously present in the reaction mixture. 



Examples 25-29: Preparations of Ethyl (RM-cvano-3-hvdroxvbutyrate from Ethvl 
(Sy4-chloro-3-hvdroxybutvrate 

For each of Examples 25-29, to a 170mL vessel connected to an automatic titrater by 
a pH electrode and a feeding tube for addition of base was charged NaCN (1.5 g, 31 mmol) 
and water (50 mL). The vessel was sealed and the pH was adjusted to 7 by the addition of 
cone. H 2 S0 4 (0.9 mL). The reaction mixture was heated to 40 °C and treated with a solution 
of halohydrin dehalogenase (0.4 g in 1 0 mL water). The halohydrin dehalogenases used for 
these Examples had the polypeptide sequences given for the following SEQ ID NOs.: 

Example 25 SEQ ID No. 32 

Example 26 SEQ ID No. 94 

Example 27 SEQ ID No. 96 

Example 2S SEQ ID No. 98 

Example 29 SEQ ID No. 100 
Then, ethyl (S)-4-chloro-3-hydroxybutyrate (5.00 g, 30.1 mmol) was added via syringe. The 
automatic titrater maintained the pH at 7 by the addition of 4M NaCN. The progress of the 
reactions were monitored by recording the cumulative volume of the NaCN solution added 
vs. time. 

Figure 4 shows the percent conversion of ethyl (S)-4-chloro-3-hydroxybutyrate 
(calculated from the cumulative equivalents of NaCN added) vs. time for each of these 
Examples. Example 25 used a halohydrin dehalogenase having the amino acid sequence 
SEQ ID NO. 32, which it the amino acid sequence of the native halohydrin dehalogenase 
from Agrobacteriwn radiobacter AD1 {hheQ, expressed from novel nucleic acid 
corresponding to SEQ IP NO: 31. Comparison of the percent conversion vs. time for 
Examples 26 through 29 to that of Example 25 shows that novel halohydrin dehalogenases of 
the present invention have greater activity than the native halohydrin dehalogenase from 
Agrobacteriwn radiobacter AD1 (hheQ. 
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Example 30:_, Preparation of Ethvl f R)-4-cvano-3-hydroxvbutvrate from Ethyl fS)-4-chlorn- 
3-hvdroxvbutvrate 

The procedure was identical to that of Examples 25-29 with the exception that 
halohydrin dehalogenase SEQ ID NO. 104 was used. After 12 hours, the reaction was 
complete based the addition of the NaCN solution, and GC analysis showed that conversion 
of the ethyl (S)-4-chloro-3-hydroxybutyrate to ethyl (R)-4-cyano-3-hydroxybutyrate'was 
>99%. The reaction mixture was cooled and combined with five other similarly produced 
final reaction solutions in a 500 mL three-neck flask. The flask was equipped with a 
condenser and nitrogen dip tube extending into the liquid, and was connected to a diaphragm 
pump to apply vacuum (500 mmHg). A caustic trap was used to trap the HCN in the off gas. 
The reaction mixture was heated to 50 °C and a nitrogen bleed was used to facilitate the 
removal of HCN. After three hours, the HCN removal was complete. The mixture was 
cooled and partitioned into centrifuge bottles. 300 mL of butyl acetate was partitioned 
among the bottles, which were capped and shaken for extraction. The mixtures were 
centrifuged at 8000 rpm for 30 minutes. The layers were separated and the aqueous phase 
was likewise extracted two more times with 300 mL of butyl acetate. The combined butyl 
acetate extracts were evaporated under vacuum to give 19.6 g of ethyl R-4-cyano-3- 
hydroxybutrate (83% yield) as a pale yellow liquid. 

Example 3 1 : Preparation of Ethvl rRV 4-cvano-3-h vdroxvbutvrate from Ethvl fS V4-chloro- 
3-hvdroxvbutvrate. 

To a 170mL vessel connected to an automatic titrater by a pH electrode and a feeding 
tube for addition of base was charged NaCN (1.5 g, 31 mmol) and water (50 mL). The vessel 
was sealed and the pH was adjusted to 7 by the addition of cone. H 2 S0 4 (0.9 mL). The 
reaction mixture was heated to 40 °C and treated with a solution of halohydrin dehalogenase 
SEQ ID NO. 100 (0.4 g in 10 mL water). Then, ethyl (S)-4-chloro-3-hydroxybutyrate (10.0 
g, 60.2 mmol) was added via syringe. The automatic titrater maintained the pH at 7 by the 
addition of 4M NaCN. After 1 8 hours, the reaction was complete based the addition of the 
NaCN solution (no more based being added, 14.6 mL cumulatively added) and GC analysis 
showed that conversion of the ethyl (S)-4-chloro-3-hydroxybutyrate to ethyl (R)-4-cyano-3- 
hydroxybutyrate was >99%. The reaction mixture was cooled and transferred to a 250 mL 
three-neck flask. The flask was equipped with a condenser and nitrogen dip tube extending 
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into the liquid, and was connected to a diaphragm pump to apply vacuum (500 mmHg). A 
caustic trap was used to trap the HCN in the off gas. The reaction mixture was heated to 50 
°C and a nitrogen bleed was used to facilitate the removal of HCN. After three hours 3 the 
HCN removal was complete. The mixture was cooled and transferred to a centrifuge bottle. 
5 60 mL of butyl acetate was added to the bottle, which was capped and shaken for extraction. 
The mixture was centrifiiged at 10,000 rpm for 30 minutes. The layers were separated and 
the aqueous phase was likewise extracted two more times with 60 mL of butyl acetate. The 
combined butyl acetate extracts were evaporated under vacuum to give 7.7 g of ethyl 
R-4-cyano-3-hydroxybutrate (81% yield). 

10 

Examples 32-36: . Preparations of t-Butyl (Ry6-cyano-5-hvdroxy-3-oxohexanoate from 
t-Butyl fRy6-chloro-5-hvdroxv-3-oxohexanoate. 

An aqueous solution containing 25 mg/mL sodium cyanide at pH 7.2 was prepared at 
by dissolving 1 .25 g NaCN and 4.2 g Na^PCU in 50 mL of water. For each of Examples 

15 32-36, 1 mL of the solution was charged to a 5 mL screw cap vial. Halohydrin dehalogenase 
powder (20 mg) was added to the vial, followed by 10 mg te^butyl-(S)-6-chloro-5-hydroxy- 
3-oxo-hexanoate (Julich Fine Chemicals). The vial was capped and the reaction was stirred 
at room temperature for 12 hours. The reaction mixture was then extracted with 1 mL MTBE 
and the organic layer was analyzed by HPLC. In each Example, the fer f-butyl rRV6-chloro- 

20 5-hydroxy-3-oxohexanoate was completely reacted. T he polypeptide sequence of the 

halohydrin dehalogenase used for each Examples and the analyzed reaction yield of fe rt-butyl 
fRV6-cyano-5-hydroxv-3-oxohexanoate were as follows: 



Example 32 


SEQIDNo.106 


25% 


Example 33 


SEQ ID No. 103 


15% 


Example .34 


SEQ ID No. 32 


15% 


Example 35 


SEQ ID No. 74 


10% 


Example 36 


SEQ ID No. 110 


10% 
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. Exam P les37 ' 41: Preparations of t- Butyl f3j?.5£V6-cvano-3S.H,-h^r OX vhexanoate frn™ 

t-ButvI('3^.5^-6-c hlorQ-3.5-dihvdroxv-3-h P .v a no a ^ 

An aqueous solution containing 28 mg/mL sodium cyanide at pH 6.7 was prepared at 
by dissolving 1 .4 g NaCN and 6 g NaH 2 P0 4 in 50 mL of water. For each of Examples 37- 
5 41 , 1 mL of the solution was charged to a 5 mL screw cap vial. Halohydrin dehalogenase 
powder (20 mg) was added to the vial, followed by 20 mg terr-butyl (3£,5.S)-6-chloro- 
3,5-dihydroxyhexanoate fChemistrv- A Europ ean Tnnmsi (2001), 7(21), 4562-4571). The 
vial was capped and the reaction was stirred at room temperature for 1 7 hours. The reaction 
mixture was then extracted with'l mL ethyl acetate and the organic layer was analyzed by 
1 0 HPLC. In each Example, only t-butyl (3i?,5i?)-6-cyano-3,5-dihydi-oxyhexanoate-and 
unreacted t-butyl (3iU^-6-cMoro-3,5-dihydroxy-3-hexanoate were detected_The 
polypeptide sequence of the halohydrin dehalogenase used for each Examples and the 
reaction yield of t-butyl (3J?,5i?)-6-cyano-3,5-dihydroxyhexanoate were as follows: 

Example 37 SEQ ID No. 100 S3% 
15 Example 38 SEQ ID No. 108 65% 

Example 39 SEQ ID No. 32- '64% 

Example 40 SEQ ID No! 74 63% 

Example 41 SEQ ID No. 102 81% 
In a control reaction, omitting halohydrin dehalogenase, the conversion was 4%. 



20 



25 



All publications, patents, patent applications, and other documents cited in this 
application are hereby incorporated by reference in their entirety for all purposes to the same 
extent as if each individual publication, patent, patent application, or other document were . 
individually indicated to be incorporated by reference for all purposes. 

While preferred embodiments of the invention have been illustrated and described, it 
will be appreciated that various changes can be made therein without departing from the spirit 
and scope of the invention. 
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We claim: 

1 . A method for producing a vicinal cyano, hydroxy substituted carboxylic acid 
ester from a vicinal halo, hydroxy substituted carboxylic acid ester, the method comprising: 

(a) providing a vicinal halo, hydroxy substituted carboxylic acid ester, 

5 wherein the halo substituent is selected from the group consisting of chlorine, 

bromine, and iodine, 

and wherein the vicinal halo, hydroxy substituted carboxylic acid ester is not a 
4-halo-3-hydroxybutyric acid ester; and 

(b) contacting the vicinal halo, hydroxy substituted carboxylic acid ester with a 

* 10 halohydrin dehalogenase and cyanide under conditions suitable to form a reaction mixture for 
converting the vicinal halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, 
hydroxy substituted carboxylic acid ester. 

2. The method of claim 1, wherein the vicinal cyano, hydroxy substituted 

15 carboxylic acid ester is a non-racemic chiral vicinal cyano, hydroxy substituted carboxylic 
acid ester. 

3. The method of claim 1 , wherein the cyanide is provided by hydrocyanic acid. 

20 4. The method of claim 1 , wherein the cyanide is provided by a cyanide salt. 

5. The method of claim 1 , wherein the halo substituent of the vicinal halo, 
hydroxy substituted carboxylic acid ester is selected from chlorine and bromine. 

25 6. The method of claim 1, wherein the vicinal halo, hydroxy substituted 

carboxylic acid ester is a vicinal chloro, hydroxy substituted carboxylic acid ester. 

7. The method of claim 1 , wherein the vicinal halo, hydroxy substituted 
carboxylic acid ester is a lower alkyl ester. 
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(1) 



S. The method of claim 1 , wherein 
the vicinal halo, hydroxy substituted carboxylic acid ester has the structure: 

R 2 



OH 




and 

(2) 



the vicinal cyano, hydroxy substituted carboxylic acid ester has the structure: 

OH o 




wherein: 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

R 1 , R 2 , and R> are each independently selected from the group consisting of hvdrogen 
fluonne, an optionally substituted lower alkyl, an optionally substituted cycloalkyl an 
optionally substituted lower alkenyl, an optionally substituted aryl, an optionally substituted 
arylalkyl, amino, an optionally substituted lower alkylamino, an optionally substituted 
cycloalklyamino, an optionally substituted lower alkoxy, an optionally substituted 
cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted arylalkoxy; and 

R a is selected from the group consisting of an optionally substituted lower alkyl an 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
arylalkyl; and. 

each M n is independently selected from -C(=0> (i.e., carbonyl) or -CR»R»\ wherein 
R* and R- are each independently selected from the group consisting of hydrogen, fluorine 
an optionally substituted lower alkyl, an optionally substituted cycloalkyl, an optionally 
substituted lower alkenyl, an optionally substituted aryl, an optionally substituted arylalkyl 
ammo, an optionally substituted lower alkylamino, an optionally substituted cycloalklvamino 
an optxonally substituted lower alkoxy, an optionally substituted cycloalkoxy, an optionally ' 
substituted aryloxy, an optionally substituted aryl, hydroxyl, nitro, amino, cyano, carboxy 
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(i.e. a carboxylate or carboxylic acid group), carboalkoxy (i.e. an ester group), carbamide 
(i.e.an amide group), and acyl (i.e. forming a ketone); and 
n is zero or an integer from 2 to 9, inclusive. 



9. The method of claim 8, wherein n is an integer from 2 to 8, inclusive. 

10. The method of claim 8, wherein n is 3. 

10 11. The method of claim 1 , wherein the vicinal cyano, hydroxy substituted 

carboxylic acid ester is a 6-halo-5-hydroxyhexanoic acid ester. 



15 



20 



25 



(1) 



and 

(2) 



1 2 . The method of claim 1 , wherein 

the vicinal halo, hydroxy substituted carboxylic acid ester has the structure: 

O 




the vicinal cyano, hydroxy substituted carboxylic acid ester has the structure: 




wherein: 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

R 1 , R 2 , R 3 , R 4 , R 6 , R 1 ', R ] ", R 3 \ and R 3 " are each independently selected from the 
group consisting of hydrogen, fluorine, an optionally substituted lower alkyl, an optionally 
substituted cycloalkyl, an optionally substituted lower alkenyl, an optionally substituted aryl, 
an optionally substituted arylalkyl, amino, an optionally substituted lower alkylamino, an 
optionally substituted cycloalklyamino, an optionally substituted lower alkoxy, an optionally 
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substituted cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted 
arylalkoxy; and 

R 5 is selected from the group consisting of an optionally substituted lower alkyl, an 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
arylalkyl. 



(1) 



1 3 . The method of claim 1 , wherein 

the vicinal halo, hydroxy substituted carboxylic acid ester has the structure: 




and 



(2) the vicinal cyano, hydroxy substituted carboxylic acid ester has the structure: 




wherein: 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

R\ R 2 , R 3 , R 4 , R 6 , R 1 ', R 1 ", R 2 ' 1 , R 3 ', and R 3 " are each independently selected from the 
group consisting of hydrogen, fluorine, an optionally substituted lower alkyl, an optionally 
substituted cycloalkyl, an optionally substituted lower alkenyl, an optionally substituted aryl, 
an optionally substituted arylalkyl, amino, an optionally substituted lower alkylamino, an 
optionally substituted cycloalklyamino, an optionally substituted lower alkoxy, an optionally 
substituted cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted 
arylalkoxy; and 

R 5 is selected from the group consisting of an optionally substituted lower alkyl, an 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
arylalkyl. 
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0) 



1 4. The method of claim 1 , wherein 

the vicinal halo, hydroxy substituted carboxylic acid ester has the structure: 

R 2 




and 

(2) the vicinal cyano, hydroxy substituted carboxylic acid ester has the structure: 

OH 




wherein: 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 

R , R , and R are each independently selected from the group consisting of hydrogen, 
fluorine, an optionally substituted lower alkyl, an optionally substituted cycloalkyl, an 
optionally substituted lower alkenyl, an optionally substituted aryl, an optionally substituted 
arylalkyl, amino, an optionally substituted lower alkylamino, an optionally substituted 
cycloalklyamino, an optionally substituted lower alkoxy, an optionally substituted 
cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted arylalkoxy; and 

R 5 is selected from the group consisting of an optionally substituted lower alkyl, an 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
arylalkyl. 



1 5 . The method of claim 1 , wherein the halohydrin dehalogenase is a naturally 
0 occurring halohydrin dehalogenase. 

16. The method of claim 1 , wherein the halohydrin dehalogenase is a non- 
naturally occurring halohydrin dehalogenase. 
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17. The method of claim 1, wherein the reaction mixture for converting the vicinal 
halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted 
carboxylic acid ester is maintained at a pH in the range of from about 5 to about 9. 

IS. The method of claim 1 7, wherein the reaction mixture for converting the 
vicinal halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted 
carboxylic acid ester is maintained at a pH in the range of from about 5 to about 8. 

19. The method of claim 1, wherein the reaction mixture for converting the vicinal 
halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted 
carboxylic acid ester is maintained at a pH of about 8 or below. 

20. The method of claim 1, wherein the reaction mixture for converting the vicinal 
halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted . 
carboxylic acid ester further comprises apH buffer. 

21. The method of claim 1 , further comprising: 

(c) adding a base sufficient to maintain the reaction mixture for converting the vicinal 
halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted 
carboxylic acid ester at a pH of about 5 or above. 

22. The method of claim 21 wherein the base is selected from hydroxide salts, 
carbonate salts, and bicarbonate salts. 



23. 



The method of claim 2 1 wherein the base is selected from a cyanide salt. 



24. The method of claim 1 , further comprising recovering the vicinal cyano, 
hydroxy substituted carboxylic acid ester from the reaction mixture for converting the vicinal 
halo, hydroxy substituted carboxylic acid ester to a vicinal cyano, hydroxy substituted 
carboxylic acid ester. 
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25. The method of claim 22, further comprising purifying the vicinal cyano, 
hydroxy substituted carboxylic acid ester. 



26. A composition comprising: 
5 (a) a halohydrin dehalogenase; 

(b) cyanide; and 

(c) a vicinal halo, hydroxy substituted carboxylic acid ester, 

wherein the vicinal halo, hydroxy substituted carboxylic acid ester is not a 4-halo-3- 
hydroxybutyric acid ester. 



10 



27. The composition of claim 26, wherein the vicinal halo, hydroxy substituted 
carboxylic acid ester has the structure: 

OH O 




15 wherein:. 

X is a halogen selected from the group consisting of chlorine, bromine, and iodine; 
R 1 , R", and R 6 are each independently selected from the group consisting of hydrogen, 
fluorine, an optionally substituted lower alkyl, an optionally substituted cycloalkyl, an 
optionally substituted lower alkenyl, an optionally substituted aryl, an optionally substituted 
20 arylalkyl, amino, an optionally substituted lower alkylamino, an optionally substituted 
cycloalklyamino, an optionally substituted lower alkoxy, an optionally substituted 
cycloalkoxy, an optionally substituted aryloxy, and an optionally substituted arylalkoxy; and 

R 5 is selected from the group consisting of an optionally substituted lower alkyl, an 
optionally substituted cycloalkyl, an optionally substituted aryl, and an optionally substituted 
25 arylalkyl; and. 

each M n is independently selected from -C(=0> (i.e., carbonyl) or -CR n 'R n "-, wherein 
R n ' and R n " are each independently selected from the group consisting of hydrogen, fluorine, 
an optionally substituted lower alkyl, an optionally substituted cycloalkyl, an optionally 
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substituted lower alkenyl, an optionally substituted aryl, an optionally substituted arylalkyl, 
amino, an optionally substituted lower alkylamino, an optionally substituted cycloalklyamino, 
an optionally substituted lower alkoxy, an optionally substituted cycloalkoxy, an optionally 
substituted aryloxy, an optionally substituted aryl, hydroxy!, nitro, amino, cyano, carboxy 
(i.e. a carboxylate or carboxylic acid group), carboalkoxy (i.e. an ester group), carbamide 
(i.e.an amide group), and acyl (i.e. forming a ketone); and 
n is zero or an integer from 2 to 9, inclusive. 
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SEQUENCE LISTING 

<110> Davis, S. Christopher- 
Grate, John H. 
Gray, David R. 
Gruber, John M. 
Huisman, Gjalt W. 
Ma, Steven K. 

Newman, Lisa M. Sheldon, Roger 
Wang, Li A. 



<12 0> Enzymatic Processes for the Production 

of 4-Substituted 3-Hydroxybutyric Acid Derivatives and 
Vicinal Cyano, Hydroxy- substituted Carboxylic Acid Esters 



<130> 0339.310WO 

<150> US Not yet assigned 
<151> 2004-02-18 

<150> US 10/639,159 
<151> 2003-08-11 

<150> US 60/402,436 
<151> 2002-08-09 

<160> 126 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ketoreductase 1 
KRED CR2-5 

<221> CDS 

<222> (1) . . . (852) 

<400>* 1 

atg gca aag aat ttt age aat gta gag tat ccc gca ccc ccc ccc gca 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Prp Ala Pro Pro Pro Ala 
1' 5-10 is 

cat aca aag aat gag age tta caa gta tta gat tta ttt aag tta aat 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn ' 
20 25 " 30 

gga aaa gta gca age ata aca gga age age age gga ata gga tat gca 144 
Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 
35 40 45 
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tta gca gag get ttt gca caa gtc gga gca gat gta gca ata tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 
5 0 55 ~ 60 

aat age cat gat gca aca gga aaa gca gag gca tta gca aag aag tat 240 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 80 

gga gta aag gta aag gca tat aaa gca aat gta age age age gat gca 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtc aag caa aca ata gag caa caa ata aag gat ttt gga cat tta gat 336 
Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 
100 105 110 

ata gta gta gca aat gca gga ata ccc tgg aca aag gga gca tat ata 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 ~ 125 

gat caa gat gat gac aag cat ttt gac caa gta gta gat gta gac tta 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 " * 135 140 

aag gga gta gga tac gta gca aag cat gca gga agg cat ttt agg gaa 480 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

agg ttt gag aaa gag gga aaa aag gga gca tta gta ttt aca gca age 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg age gga cat ata gta aat gtc ccc caa ttc caa gca aca tat aat 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gca gca aag gca gga gta agg cat ttt gca aag age tta gca gtc gag 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt gca ccc ttt gca agg gta aat age gta age ccc gga tat ata aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

aca gag ata age gat ttc gtc ccc caa gag aca caa aat aag tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 * 230 235 " ~ 240 

age tta gtc ccc tta gga agg gga gga gag aca gca gag tta gta gga 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gca tat tta ttc tta gca age gat gca gga age tat gca aca gga aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 * 270 
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gat ata ata gta gat gga gga tat aca tta ccc taa 
Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



852 



<210> 2 
<211> 283 
<212> PRT 

<213> Candida magnoliae 
<400> 2 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

15 10 is 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 30 

Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

.50 55 " 60 

Asn Ser His Asp Ala Thr Gly Lys Ala .Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 * 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 -90 95 

Val Lys Gin Thr He Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 

100 105 no 

"He Val Val Ala Asn Ala Gly lie Pro Trp Thr Lys Gly Ala Tyr He 

115 120 " 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 " 160 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 . 

Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 

180 185 . 190 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 " 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

210 215 220 

Thr Glu lie Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 " 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro 
275 230 



<210> 3 

<211> 852 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ketoreductase 2 
KRED CR1-2 
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<22l> CDS 

<222> (1) . . . (852) 

<400> 3 



atg gca aag aat ttt age aat gta gag tat ccc gca ccc ccc ccc qca 
Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 



1 5 



10 , is 



cat aca aag aat gag age tta caa gta tta gat tta ttt aag tta aat 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 



30 



35 40 



45 



60 



aat age cat gat gca aca gga aaa gca gag gca tta gca aag aag tat 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tvr 
65 70 75 80 

gga gta aag gta aag gca tat aaa gca aat gta age age age gat gca 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 , 95 



110 



aag gga gta gga tac gta gca aag cat gca gga agq cat ttt agg gaa 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 15* 



160 



atg age gga cat ata gta aat gtc ccc caa ttc caa gca aca tat aat 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 



ttt gca ccc ttt gca agg gta aat age gta age ccc gga tat ata aat 



48 



96 



gga aaa gta gca age ata aca gga age age age gga ata gga tat qca 144 
Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 



tta gca gag get ttt gca caa gtc gga gca gat gta gca. ata tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 
50 55 ' 



240 



283 



gtc aag caa aca ata gag caa caa ata aag gat ttt gga cat tta gat 336 
Val Lys Gin Thr He Glu Gin Gin He Lys As P Phe Gly His Leu Asp 
100 105 " " ~ 



ata gta gca gca aat gca gga ata ccc tgg aca aag gga gca tat ata 3 84 

lie Val Ala Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 
115 120 i 2 5 

gat caa gat gat gac aag cat ttt gac caa gta gta gat gta gac tta 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val As P Val £sp Leu 
13 ° 135 140 



480 



agg ttt gag aaa gag gga aaa aag gga gca tta gta ttt aca gca age 5?8 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 



576 



gca gca aag gca gga gta agg cat ttt gca aag age tta gca gtc gag $->a 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val -Glu 
195 200 205 



S11 



4 



WO 2005/018579 



PCT/US2004/004819 



Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

aca gag ata age gat ttc gtc ccc caa gag aca caa aat aag tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 ~ 230 235 240 

age tta gtc ccc tta gga agg gga gga gag aca gca gag tta gta gga 7 69 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 * 250 255 

gca tat tta ttc tta gca age gat gca gga age tat gca aca gga aca , 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 . 270 

gat ata ata gta gat gga gga tat aca tta ccc taa 852 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 4 

<211> 283 

<212> PRT 

<213> Candida magnoliae 



<400> .4 



Met 


Ala 


Lys 


Asn 


Phe Ser Asn 


Val 


Glu Tyr 


Pro 


Ala 


Pro 


Pro 


Pro 


Ala 


1 






5 




10 










15 




His 


Thr 


Lys 


Asn 


Glu Ser Leu 


Gin 


Val Leu 


Asp 


Leu 


Phe 


Lys 


Leu 


Asn 






20 






25 








30 






Gly 


Lys 


Val 


Ala 


Ser He Thr 


Gly 


Ser Ser 


Ser 


Gly 


He 


Gly 


Tyr 


Ala 


35 






40 








45 








Leu 


Ala 


Glu 


Ala 


Phe Ala Gin 


Val 


Gly Ala 


Asp 


Val 


Ala 


He 


Trp 


Tyr 




50 






55 








60 










Asn 


•Ser 


His 


Asp 


Ala Thr Gly 


Lys 


Ala Glu 


Ala 


Leu 


Ala 


Lys 


Lys 


Tyr 


65 






70 






75 










80 


Gly 


Val 


Lys 


Val 


Lys Ala Tyr 


Lys 


Ala Asn 


Val 


Ser 


Ser 


Ser 


Asp 


Ala 








85 




90 










95 




Val 


Lys 


Gin 


Thr 


He Glu Gin 


Gin 


He Lys 


Asp 


Phe 


Gly* 


His 


Leu 


Asp 






100 






105 








110 






He 


Val 


Ala 


Ala 


Asn Ala Gly 


He 


Pro Trp 


Thr 


Lys 


Gly 


Ala 


Tyr 


He 






115 






120 








125 








Asp 


Gin 


Asp 


Asp 


Asp Lys His 


Phe 


Asp Gin 


Val 


Val 


Asp 


Val 


Asp 


Leu 




130 






135 








140 










Lys 


Gly 


Val 


Gly 


Tyr Val Ala 


Lys 


His Ala 


Gly 


Arg 


His 


Phe 


Arg 


Glu 


145 






150 






155 










160 


Arg 


Phe 


Glu 


Lys 


Glu Gly Lys 


Lys 


Gly Ala 


Leu 


Val 


Phe 


Thr 


Ala 


Ser 






165 




170 










175 




Met 


Ser 


Gly 


His 


He Val Asn 


Val 


Pro Gin 


Phe 


Gin 


Ala 


Thr 


Tyr 


Asn 






180 






185 








190 






Ala 


Ala 


Lys 


Ala 


Gly Val Arg 


His 


Phe Ala 


Lys 


Ser 


Leu 


Ala 


Val 


Glu 






195 






200 








205 








Phe 


Ala 


Pro 


Phe 


Ala Arg Val 


Asn 


Ser Val 


Ser 


Pro 


Gly 


Tyr 


He 


■Asn 




210 






215 








220 










Thr 


Glu 


He 


Ser 


, Asp Phe Val 


Pro 


Gin Glu 


Thr 


Gin 


Asn 


Lys 


Trp 


Trp 


225 








230'. 






235 










240 


Ser 


Leu 


Val 


Pro 


Leu Gly Arg 


Gly 


Gly Glu 


Thr 


Ala 


Glu 


Leu 


Val 


Gly 
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245 250 -55 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

250 265 " 970 

Asp He. He Val Asp Gly Gly Tyr Thr Leu Pro 
275 230 

<210> 5 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ketoreductase 3 
KRED CR1-3 

<221> CDS 

<222> (1) . . . (852) 

<400> 5 

atg gca aag aat ttt age aat gtg gag tat ccc gca ccc ccc ccc qca 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ila Pro Pro Pro Ala 

5 10 15 

cat aca aag aat gag age tta caa gta tta gat tta ttt aaa tta 
His Thr Lys Asn Glu Ser Leu Gin Val Leu fsp Leu Phe Lvs Leu Asn 
20 2 * io 

gga aaa gta gca age ata aca gga aqc acre aoc aaa t-„*- 
Gly Lys Val Ala Ser lie Thr £y sir Ser sir Ify Tie lly Tyr S 
35 40 45 

T tta !?* !f 9 9Ct tfct 9Ca Caa gtc " a aca 9 at 9 fc a gca ata tgg tat 
Leu Ala- Glu Ala Phe Ala Gin Val Gly Ala Asp Val La lie Tr| Tyr 
b0 -55 60 

aat age cat gat gca aca gga aaa gca gag gca tta gca aag aag. tat 
Asn Ser H iS Asp Ala Thr Gly Lys Ala Glu Ala Leu Ila Lys Lyl Tyr 
65 70 75 £ 0 

gga gta aag gta aag gca tat aaa gca aat gta age age age qat aca 
Gly val Lvs Val Lys Ala Tyr Lys Ala Asn Val sir Ser sir fsp Sa 
85 5° 95 ■ 

gtc aag caa aca ata gag caa caa ata aaq gat ttt qqa cat tta aat- 
Val Lys Gin Thr lie Glu Gin Gin lie Lys Lp Phe lly His Leu Sp 
100 105 " 



110 



ata gta gta gca aat gca gga ata ccc tgg aca aag qqa aca tat ata 
He Val Val Ala Asn Ala Gly He Pro Tr? Thr Lys gj L a ' T yr lie 
115 120 



125 



gat caa gat gat gac aag cat ttt gac caa gta gta gat gta gac tta 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Isp Leu 

135 140 

aag gga gta gga tac gta gca aag cat gca gga agg cat ttt aaa aaa 
Lys Gly val Gly Tyr Val Ala Lys His Ala Gly A?g His Pne Sg I" 



48 



96 



144 



192 



240 



238 



336 



384 



432 



480 
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145 150 155 160 

agg ttt gag aaa gag gga aaa aag gga gca tta gta ttt aca gca age 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg age gga cat ata gta aat gtc ccc caa ttc caa gca aca tat aat 576 
Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 135 190 

gca gca aag gca gga gta agg cat ttt gca aag age tta gca gtc gag 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt gca ccc ttt gca agg gta aat age gta age ccc gga tat ata aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 



aca gag ata age gat ttc gtc ccc caa gag aca caa aat aag tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
25 " 230 235 240 



720 



768 



age tta gtc ccc tta gga agg gga gga gag aca gca gag tta gta gga 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gca tat tta ttc tta gca age gat gca gga age tat gca aca gga aca 816 

Ala Tyr Leu Phe Leu Ala Ser' Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 

gat ata ata gta gat gga gga tat aca tta ccc taa 852 

Asp He lie Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 6 
<211> 283 
<212> PRT 

<213> Candida magnoliae 
<400> 6 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

1 5 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 30 

Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 60 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 ^90 95 

Val Lys Gin Thr He Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 

100 105 110 . 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 
115 120 125 
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Asp 


Gin 


Asp 


Asp 


Asp Lys 


His 


Phe 




130 








135 




Lys 


Gly Val 


Gly 


Tyr Val 


Ala 


Lys 


145 








150 






Arg 


Phe 


Glu 


Lys 


Glu Gly 


Lys 


Lys 










165 






Met 


Ser Gly 


His 


He Val 


Asn 


Val 








180 








Ala 


Ala 


Lys 


Ala 


Gly Val 


Arg 


His 






195 








200 


Phe 


Ala 


Pro 


Phe 


Ala Arg 


Val 


Asn 




210 








215 




Thr 


Glu 


He 


Ser 


Asp Phe 


Val 


• Pro 


225 








230 






Ser 


Leu 


Val 


Pro 


Leu Gly 


Arg 


Gly 










245 






Ala 


Tyr 


Leu 


Phe 


Leu Ala 


Ser 


Asp 








260 








Asp 


lie 


He 


Val 


Asp Gly 


Gly Tyr 






275 








280 



Asp Gin Val Val Asp Val Asp Leu 
140 

His Ala Gly Arg His Phe Arg Glu 
155 160 
Gly Ala Leu Val Phe Thr Ala Ser 

170 175 
Pro Gin Phe Gin Ala Thr Tyr Asn 
185 190 
Phe Ala Lys Ser Leu Ala Val Glu 
205 

Ser Val Ser Pro Gly Tyr He Asn 
220 

Gin Glu Thr Gin Asn Lys Trp Trp 
235 ~ 240 

Gly Glu Thr Ala Glu Leu Val Gly 

250 255 
(Ala Gly Ser Tyr Ala Thr Gly Thr 
265 270 
Thr Leu Pro 



<210> 7 

<211> 852 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ketoreductase 4 
KRED CR2-4 

<221> CDS 

<222> (1) . . . (852) 

<400> 7 

atg gca aag aat ttt age aat gta gag tat ccc gca ccc ccc ccc gca 4 8 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1 5 io is 

cat aca aag aat gag age tta caa gta tta gat tta ttt aag tta aat 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 " 30 

gga aaa gta gca age ata aca gga age age age gga ata gga tat gca 144 
Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 
35 40 45 

tta gca gag get ttt gca caa gtc gga gca gat gta gca ata tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 

aat age cat gat gca aca gga aaa gca gag gca tta gca aag aag tat 240 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 ^ " 80 

gga gta aag gta aag gca tat aaa gca aat gta age age age gat gca 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 
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gtc aag caa aca ata gag caa caa ata aag gat ttt gga cat tta gat 336 
Val Lys Gin Thr lie Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 105 110 

ata gta gta gca aat gca gga ata ccc tgg aca aag gga gca tat ata 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 125 

gat caa gat gat gac aag cat ttt gac caa gta gta gat .gta gac tta • 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 ' " 135 140 

aag gga gta gga tac gta gca aag' cat gca gga agg cat ttt agg gaa 430 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

agg ttt gag aaa gag gga aaa-aag gga gca tta gta ttt aca gca age 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg age gga cat ata gta aat gtc ccc caa ttc caa gca aca tat aat 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gca gca aag gca gga gta agg cat ttt gca aag age tta gca gtc gag 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 * 200 205 



ttt gca ccc ttt gca agg gta aat age gta age ccc gga tat ata aat 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 • 215 220 

aca gag ata age gat ttc gtc ccc caa gag aca caa aat aag tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 ' 230 235 240 



gca tat tta ttc tta gca age gat gca gga age tat gca aca gga aca 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 

gat ata ata gta gat gga gga tat act tta ccc taa 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 8 
<211> 283 
<212> PRT 

<213> Candida magnoliae 
<400> 8 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 



672 



720 



age tta gtc ccc tta gga agg gga gga gag aca gca gag tta gta gga 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 



816 



852 
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1 




His Thr 


Lys Asn 




20 


Gly Lys 


Val Ala 




35 


Leu Ala 


Glu Ala 


50 ' 




Asn Ser 


His Asp 


65 




Gly Val 


Tivs Val 

jj y o v ax 


Val Lys 


Gin Thr 




100 


He Val 


Val Ala 




115 


Asp Gin 


Asp Asp 


130 




Lys Gly 


Val Gly 


145 




Arg Phe 


Glu Lys 


Met Ser 


Gly His 




180 


Ala Ala 


Lys Ala 




195 


Phe Ala 


Pro Phe 


210 




Thr Glu 


He Ser 


225 




Ser Leu 


Val Pro 


Ala Tyr 


Leu Phe 




260 


Asp He 


He Val 




275 



GlU 


Ser Leu 


Gin 


Ser 


lie Thr 


Gly 






40 


Phe 


Ala Gin 


Val 




55 




Ala 


Thr Gly 


Lys 




70 




Lys 


Ala Tyr 


Lys 


QC 
O 3 






He 


Glu Gin 


Gin 


Asn 


Ala Gly 


He 






120 


Asp 


Lys His 


Phe 




135 




Tyr 


Val Ala 


Lys 




150 




Glu 


Gly Lys 


Lys 


lo b 






He 


Val Asn 


Val 


Gly 


Val Arg 


His 






200 


Ala 


Arg Val 


Asn 




215 




Asp 


Phe Val 


Pro 




230 




Leu 


Gly Arg Gly 


245 






Leu 


Ala Ser 


Asp 


Asp 


Gly Gly Tyr 






230 



10 

Val Leu Asp Leu 
25 

Ser Ser Ser Gly 

Gly Ala Asp Val 
60 

Ala Glu Ala Leu 
75 

Ala Asn Val Ser 
90 

He Lys Asp Phe 
105 

Pro Trp Thr Lys 

Asp Gin Val Val 
140 

His Ala Gly Arg 
155 

Gly Ala Leu Val 
170 

Pro Gin Phe Gin 
185 

Phe Ala Lys Ser 

Ser Val Ser Pro 
220 

Gin Glu Thr Gin 
235 

Gly Glu Thr Ala 
250 

Ala Gly Ser Tyr 
265 

Thr Leu Pro 



15 

Phe Lys Leu Asn 
30 

He Gly Tyr Ala 
45 

Ala He Trp Tyr 

Ala Lys Lys Tyr 
80 

Ser Ser Asp Ala 
95 

Gly His Leu Asp 
110 

Gly Ala Tyr He 
125 

Asp Val Asp Leu 

His Phe Arg Glu 
160 

Phe Thr Ala Ser 
175 

Ala Thr Tyr Asn 
190 

Leu Ala Val Glu 
205 

Gly Tyr He Asn 

Asn Lys Trp Trp 
240 

Glu Leu Val Gly 
255 

Ala Thr Gly Thr 
270 



<210> 9 

<211> 786 

<212> DNA 

<213> Bacillus sp. 



<220> 

<221> CDS 

<222> {!)... (786) 

<223> Glucose dehydrogenase S06-3 



<400> 9 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 48 
Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 
1^5 10 15 

tea ggg etc gga aag gcg atg gec att cgc ttc ggc aag gag cag gca 96 
Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 
20 25 30 

aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag gta 144 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
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35 40 45 

aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 
Lys Glu Glu Val He Lya Ala Gly Gly Glu Ala Val Val Val Gin Gly 
50 55 60 

gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 240 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 80 

aag gag ttc ggc aca etc gat att atg att aat aat gee ggt ctt gaa 288 
Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 
85 90 ' 95 

aat cct gtg cca tct cac gaa atg ccg etc aag gat tgg gat aaa gtc 336 
Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 
100 105 " no 

ate ggc acg aac tta acg ggt.gcc'ttt tta gga age cgt gaa gcg att 384. 
He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala He 
115 120 . " 125 

aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec 432 
Lys Tyr Phe Val Glu Asn Asp lie Lys Gly Asn Val He Asn Met Ser 
130 135 140 

agt gtg cac gaa gtg att cct tgg ccg tta ttt gtc cac tat gcg gca 480 
Ser Val His Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
' 145 ' 150 155 * 160 

agt aaa. ggc ggg ata aag ctg atg aca gaa aca tta gcg ttg gaa tac 528 
Ser Lys Gly Gly He Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 
165 170 175 

gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt gcg ate aac 576 
Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala He Asn 
180 185' 190 

acg cca ate aat get gaa aaa ttc get gac cct aaa cag aaa get gat 624 
Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 
195 200 205 

gta gaa age atg att cca atg gga tat ate ggc gaa ccg gag gag ate 672 
Val Glu Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 
210 215 220 

gee gca gta gca gec tgg ctt get teg aag gaa gee age tac gtc aca 720 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 235 ~ 240 

ggc ate acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea ttc 768 
Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
^45 250 4 255 

cag gca ggc cgc ggt taa 736 
Gin Ala Gly Arg Gly * 
260 
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<210> 10 

<2U> 261 

<212> PRT 

<213> Bacillus sp . 



<400>- 10 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val 

1 5 10 

Ser Gly Leu Gly Lys Ala Met Ala He Arg 

20 25 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys 

35 40 
Lys Glu Glu Val He Lys Ala Gly Gly Glu 

50 55 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn 
65 70 

Lys Glu Phe Gly Thr Leu Asp He Met He 

B5 90 
Asn Pro Val Pro Ser His Glu Met Pro Leu 

■ 100 105 
He Gly Thr Asn Leu Thr Gly Ala Phe Leu 

115 120 
Lys Tyr Phe Val Glu Asn Asp lie Lys Gly 

130 135 
Ser Val His Glu Val He Pro Trp Pro Leu 
145 150 

•Ser Lys Gly Gly He Lys Leu Me.t Thr Glu 
165 170 
Ala Pro Lys Gly lie Arg Val Asn Asn lie 

ISO 185 
Thr Pro ile Asn Ala Glu Lys Phe Ala Asp 

195 200 
Val Glu Ser Met lie Pro Met Gly Tyr Ile 

210 215 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys 
225 230 

Gly lie Thr Leu Phe Ala Asp Gly Gly Met 
-245 250 
Gin Ala Gly Arg Gly 
260 



Ala He Thr Gly 
Phe Gly 
Gin Asp 



Ala Val 

60 
He Val 
75 

Asn Asn 

Lys Asp 

Gly Ser 

Asn Val 
140 
Phe Val 
155 

Thr Leu 

Gly. Pro 

Pro Lys 

Gly Glu 
220 
Glu Ala 
235 

Thr Gin 



Lys Glu 

30 
Pro Asn 
45 

Val Val 



Gin Thr 

Ala Gly 

Trp Asp 
110 
Arg Glu 
125 

Ile Asn 

His Tyr 

Ala Leu 

Gly Ala 
190 
Gin Lys 
205 

Pro Glu 
Ser Tyr 
Tyr Pro 



Ala Ala 
15 

Gin Ala 

Glu Val 

Gin Gly 

Ala lie 
80 

Leu Glu 
95 

Lys Val 

Ala Ile 

Met Ser 

Ala Ala 
160 
Glu Tyr 
175 

lie Asn 

Ala Asp 

Glu lie 

Val Thr 
240 
Ser Phe 
255 



<210> 11 

<211> 786 

<212> DNA 

<213> Bacillus sp. 



<220> 

<221> CDS 

<222> (1) . . . (786) 

<223> Glucose dehydrogenase M02-6- 



<400> 11 

atg tat aaa gat tta gaa gga aaa gta gtt gtc ata aca ggt tea tct 48 
Met Tyr Lys Asp Leu Glu Gly Lys Val Val Val lie Thr Gly Ser Ser- 
15 10 15 
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acc ggt tta gga aaa gca atg gcg att cgt ttt gcg aca gaa aaa get 
Thr Gly Leu Gly Lys Ala Met Ala He Arg Phe Ala Thr Glu Lys Ala 
20 25 30 



96 



aaa gta gtt gtg aat tat cgt t'cg aaa gaa gaa gaa get aac age gtt 144 
Lys Val Val Val Asn Tyr Arg Ser Lys Glu Glu Glu Ala Asn -Ser Val 
35 40 45 

tta gaa gaa att aaa aaa gtc ggc gga gag gca att gcg gtt aaa ggt 192 
Leu Glu Glu He Lys Lys Val Gly Gly Glu Ala He Ala Val Lys Gly 
50 55 60 . 



gac gta aca gtt gag tct gac gtg ate aat tta gtt caa tct get att 
Asp Val Thr Val Glu Ser Asp Val He Asn Leu Val Gin Ser Ala He 
65 70 75 * ao 

aaa gaa ttt gga aag tta gat -gtt atg att aat aac gca gga atg gaa 
Lys Glu Phe Gly Lys Leu Asp Val Met He Asn Asn Ala Gly Met Glu 
85 90 95 



agt gtt cat gag aaa att cct tgg cca tta ttt gtt cat tac gca gca 
Ser Val His Glu Lys lie Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 * 150 155 ~ 160 

agt aaa ggt ggc atg aag etc atg act gaa aca ctt gca. tta gaa tat 
Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 
165 170 175 



240 



288 



aat ccg gtt tea tct cat gaa atg tct tta age gat tgg aat aaa gta 336 
Asn Pro Val Ser Ser His Glu Met Ser Leu Ser Asp Trp Asn Lys Val 
100 105 no 

att gat acg aac tta acg gga gca ttt tta gga age cgt gaa geg att 3 8* 

He Asp Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 
115 120 125 

aaa tat ttc gtg gaa aat gat att aag gga aca gtt att aat atg teg 43? 
Lys Tyr Phe Val Glu Asn Asp lie Lys Gly Thr Val lie Asn Met Ser 
130 135 140 



480 



528 



get cea aaa ggt att cgt gta aat aac att ggg ccg gga gcg att aat 576 
Ala Pro Lys Gly He Arg Val Asn Asn lie Gly Pro Gly Ala lie Asn 
180 185 " igo 

aca ccg att aac get gag aaa ttt get gat cct aag cag cgc gca gat 624 
Thr Pro lie Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Arg Ala Asp 
195 200 205 

gta gaa age atg att cca atg gga tac ate gga gag ccg gaa gaa att 672 
Val Glu Ser Met lie Pro Met Gly Tyr He Gly Glu Pro Glu Glu lie 
210 215 220 

gca gcg gtt get gca tgg eta get tct tea gaa gca agt tat gta aca 720 
Ala Ala Val Ala Ala Trp Leu Ala Ser Ser Glu Ala Ser Tyr Val Thr 
225 230" 235 240 
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999 att acg etc ttt get gac ggc ggt atg aca cag tac cca tea ttc 768 
bly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 



245 250 



255 



caa gca gga cgc gga taa fl 
Gin Ala Gly Arg Gly * 



260 



<210> 12 

<211> 261 

<212> PRT 

<213> Bacillus sp. 

<400> 12 

Met Tyr Lys Asp Leu Glu Gly Lys Val Val Val He Thr Gly Ser Ser 

15 1° 15 

Thr Gly Leu Gly Lys Ala Met Ala He Arg Phe Ala Thr Glu Lvs Ala 

20 25 30 " 

Lys .Val Val Val Asn Tyr Arg Ser Lys Glu Glu Glu Ala Asn Ser Val 

35 4 0 45 

Leu Glu Glu He Lys Lys Val Gly Gly Glu Ala He Ala Val Lys Gly 

50 55 60 

Asp Val Thr Val Glu Ser Asp Val He Asn Leu Val Gin Ser Ala He 
65 70 75 ao 

Lys Glu Phe Gly Lys Leu Asp Val Met He Asn Asn Ala Gly Met Glu 

35 90 95 

Asn .Pro Val Ser Ser His Glu Met Ser Leu Ser Asp Trp Asn Lys Val 

100 105 110 

He Asp Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala He 

115 120 12 5 

Lys Tyr Phe Val Glu Asn Asp lie Lys Gly Thr Val He Asn Met Ser 

130 • ■ 135 i4 0 

Ser Val His Glu Lys He Pro Trp Pro Leu Phe Val His Tvr Ala Ala 
145 • 150 155 ' " 160 

Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 

1^5 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala He Asn 

180 185 igo 

Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Arg Ala Asp 

195 200 205 

Val Glu Ser Met lie Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 

210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Ser Glu Ala Ser Tyr Val Thr 
225 230 235 2*0 

Gly lie Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 

• 245 250 255 

Gin Ala Gly Arg Gly 
260 

<210> 13 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH.l 
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<221> CDS 

<222> (1) . . . (765) 

<400> 13 

atg age acc get ate gtc acc aac gtc aaa cat ttt ggt ggt atg ggt 48 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe. Gly Gly Met Gly 

1 5 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get -tgc cat gat 96 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 

Glu Ser Phe Lys Gin Lys Asp Glu' Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get" gaa ctg ate gaa- 192 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

50 55 60 

get gtc aec age get tac ggt cag gtc gat gtc ctg gtc age aac gat 240 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

65 70 75 80 . 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 

lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

85 '90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

100 - 105 110 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 125 

ttc ate act teg get act ccg ttc ggg ccg tgg aag gag eta teg act 432 

Phe -He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 

145 150 155 160 

aag gag eta ggg gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 
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caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg £72 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
21 ° 215 220 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 . 230 235 240 



720 



ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 7 s 5 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 14 
<211>- 254 
<212> PRT 

<213> Agrobacterium sp. 
<400> 14 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asd 

20 25 3 0 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 ' 45 

Pro Gin Leu Lys ' Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 . 55 so 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn A*p 
65 70 75 so 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 * 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 i 10 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 

115 120 ' 125 

Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180. 185 iQo 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 2 40 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 
245 ' 250 

<210> 15 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> HHDH.2 

<221> CDS 

<222> (1) . . . (765) 

<400> 15 

atg age acc get ate gtc ace aac gtc aaa cat ttt ggt ggt atg ggt 43 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
1 5 10 15 . 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
2 0 25 3 0 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 • 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 60 

get gtc acc age get tac ggt caa gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 283 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 ~ 110 

aat get gtg get teg caa ' atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 
115 120 125 

ttc ate act teg get act ccg ttc ggg ccg tgg aag gag eta teg act 432 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta ggg gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro -Trp 
180 185 ' 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
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195 200 205 

caa cga tta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg €7^ 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 ^20 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 23 ° ' 235 2 4 0 



720 



ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 7£5 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



'<210> 15 
<211> 254 
<212> PRT 

<213> Agrobacterium sp. 
<40Q> 16 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 1S 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 - 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 3 

. Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 go 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp" Val Leu Val Ser Asn Asp 
65 70 75 80 

He -Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg -Gly Ala Val Glu Ala Leu Gin He. Arg Pro" Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120- * !2S 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 i4Q 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 i 6 o 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

135 200 ~ 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 17 
<211> 765 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> HHDH.16 

<221> CDS 

<222> (1) , . . (765) 

<400> 17 

atg age acc get ate gtc acc aac gtc aaa cat ttt ggt ggt atg ggt 
Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
15 10 15 



48 



age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 



cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 so 



192 



get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 " 95 

tac agg ggt get gtc gaa get ctg cag ate. agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 
100 105. no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get act ccg ttc ggg ccg tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 190 
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aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Glv Glu Leu Val Ala Phe Leu 
210 215 " 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 " 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 18 
<211> 254 
<212> PRT 

<213> Agrobacterium sp. 
<400> 18 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10; is 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 . 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 ~ 75 80 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 " 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu -Val 

100 105 HO 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 

115 120 " 125 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 * " 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 " 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Giu Pro Trp 

130 185 ' 190 . 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 " 240 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 
245 " 250 

<210> 19 



20 



WO 2005/018579 



PCTYUS2004/004819 



<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic primer 
<400> 19 

gaattcgccc atatgtatcc ggatttaaaa gg 32 

<210> 20 
<211> 34 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> Synthetic primer 
<400> 20 

tggccggatc ctcattaacc gcggcctgcc tgga 34 

<210> 21 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic primer 
<400> 21 

gaattcgccc atatgtataa agatttagaa gg 32 

<210> 22 ■ 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic primer 
<400> 22 

ggccggatcc tcattatccg cgtcctgctt gga 3 3 

<210> 23 
<211> 7b5 
<212>'DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016013-B-03 

<221> CDS 

<222> (1) . . . (765) 

<400> 23 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 
Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly ■ 
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10 15 



age get ctg agg ctg age gaa get ggt cat ace gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His A=;p 
20 2 5 * ^ 



30 



cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 so 



80 



ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 



95 



100 105 



110 



aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 ~ 125 



tac act teg get cga get ggg get tgt act eta get aat get eta teg 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys. Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 



96 



gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa ace tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 



192 



get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 2^0' 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn As P 
65 70 75 



288 



tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 



384 



ttc ate act teg get get ccg ttc ggg cca tgg aaa gag eta teg act 4 3: 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130" 135 14Q 



480 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
135 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttc ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 " 2 40 
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ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 " ~ 250 



<210> 24 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016013-B-03 



<400> 24 



Met Ser Thr 


Ala 


He 


Val 


Thr 


Asn Val Lys His 


Phe Gly Gly Met 


Gly 


1 




5 






10 








15 


Ser Ala Leu 


Arg 


Leu 


Ser 


Glu 


Ala Gly His Thr 


Val 


Ala 


Cys 


His 






20 








25 






30 




Glu Ser Phe 


Lys 


Gin 


Lys 


Asp 


Glu Leu Glu Ala 


Phe 


Ala 


Glu 


Thr 




35 










40 




45 






Pro Gin Leu 


Lys 


Pr,q 


Met 


Ser 


Glu Gin Gin Pt~o 


Ala 


Glu 


Leu 


He 


vaJLU 


50 








55 




60 










Ala Val Thr 


Ser 


Ala 


Tyr 


Gly 


Gin Val Asp Val 


Leu- 


Val 


Ser 


Asn 


Asp 


65 






70 




75 










80 


lie Phe Ala 


Pro 


Glu 


Phe 


Gin 


Pro He Asp Lys 


Tyr Ala Val 


Glu 


Asp 






85 






90 








95 


Tyr Arg Gly 


Ala 
100 


Val 


Glu 


Ala 


Leu Gin He Arg 
105 


Pro 


Phe 


Ala 
110 


Leu 


Val 


Asn Ala Val 


Ala 


Ser 


Gin 


Met 


Lys Lys Arg Lys 


Ser Gly His 


He 


He 


115 










120 




125 








Phe He Thr 


Ser 


Ala 


Ala 


Pro 


Phe Gly Pro Trp 


Lys 


Glu 


Leu 


Ser 


Thr 


130". 








135 




140 










Tyr Thr Ser 


Ala 


Arg 


Ala 


Gly 


Ala Cys Thr Leu 


Ala 


Asn 


Ala 


Leu 


Ser 


145 






150 




155 










160 


Lys Glu Leu 


Gly Glu 


Tyr 


Asn 


He Pro Val Phe 


Ala 


He 


Gly Pro 


Asn 






165 






170 








175 




Tyr Leu His 


.Ser 
180 


Glu 


Asp 


Ser 


Pro Tyr Phe Tyr 
185 


Pro 


Thr 


Glu 
190 


Pro 


Trp 


Lys Thr Asn 


Pro 


Glu 


His 


Val 


Ala His Val Lys 


Lys 


Val 


Thr 


Ala 


Leu 


195 










200 




205 








Gin Arg Leu 


Gly Thr 


Gin 


Lys 


Glu Leu Gly Glu 


Leu 


Val 


Ala 


Phe 


Leu 


210 








215 




220 










Ala Ser Gly 


Ser 


Cys 


Asp 


Tyr 


Leu Thr Gly Gin 


Val 


Phe 


Trp 


Leu 


Ala 


225 






230 




.235 










240 


Gly Gly Phe 


Pro 


Met 

245 


He 


Glu 


Arg Trp Pro Gly 
250 


Met 


Pro 


Glu 







<210> 25 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016015-C-04 
<221> CDS 
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<222> (1) . . . (765) 



<400> 25 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
1 5 10 is 



tac eta cac teg gag gat teg ccg tac ttc tac ceg act gag ccg tgg 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 * 190 



43 



age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 



192 



240 



cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 so 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 so 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc cag gat 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 
' 85 90 * ~ 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get ctg gcg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Ala 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 
115 120 125 



288 



334 



ttc ate act teg get get ccg ttc ggg cca tgg aag qag eta teg act 432 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 430 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate gqg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 



576 



aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 " 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
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210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

225 230 235 240 



720 



ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



■ <210> 26 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016015-C-04 
<400> 26 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10. " is 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
* 5 70 75 so 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 

85 90 95 

Tyr Arg- .Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Ala 

100 105 Ho 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 * 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr. Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 ' 205 ■ 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 , 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 ~ 240 

Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 27 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> HHDH P016014-E-01 

<221> CDS 

<222> (1) . . . (765) 

<400>'27 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
1 5 10 is 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 -30 

gaa age ttt- aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca- cag ctg aaa cca atg age gaa cag gaa cca get gac ctg att gaa 192 
•Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Asp Leu He Glu 
50 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gac 288 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 ' 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 HO 

-aat get gtg get teg caa atg aag aaa cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get act ccg ttc ggg cca tgg aaa gag eta teg act 432 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 ' 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 57 6 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
ISO 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
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Lys Thr Asn Pro Glu His Val Ala His Val Lys Lvs Val Thr Ala Leu 
195 200 " " 205 

caa cga eta ggg act caa aaa gag ttg ggg qa a ttg gtg gca ttc ctg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act gg c cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

ggc ggc ttt ccc att ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro He He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 28 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016014-E-01 
<400> 28 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Asp Leu He Glu 

50 55 .60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 ao 

lie Phe Ala Pro Glu Phe "Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 • 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 ' 120 " " 125 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 * 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 i6o 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

Gly Gly Phe Pro He He Glu Arg Trp Pro Gly Met Pro Glu 
24 5 250 
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<210> 29 

<211> 768 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016014-G-08 

<221> CDS 

<222> (1) . . . (765) 

<400> 29 

atg age acc get att gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 

Met Ser Thr Ala He Val Thr Asn Val Lya His Phe Gly Gly Met Gly 

1 5 10 'is 

age get ctg aaa ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Lys Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

.ggt age ttt aag cat aaa gat gaa ctg gaa get ttt get gaa gee tac 144 
Gly Ser Phe Lys His Lys Asp Glu Leu Glu Ala Phe Ala Glu Ala Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg att gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get ttt ggt cat gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Phe Gly His Val Asp Val Leu Val Ser Asn Asp 
65 • 70 75 80 

ate ttt get eta gaa ttt egg cca ate gat aaa tac get gtc gag gat 2 88 

He Phe Ala Leu Glu Phe Arg Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 95 . 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 110 

aat get gtg get cca caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Pro Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get gec ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac tct teg get cga get ggg get agt gca eta get aat get eta teg 480 
Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 
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tac eta cac teg gag gat teg ccg ttc tat tac ccc act gag ccg tqg ^75 
Tyr Leu His Ser Glu Asp Ser Pro Phe Tyr Tyr Pro Thr Glu Pro Trp 
180 185 ^ 190 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 6^4 
Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 " 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 67? 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 



gca tct ggc tct tgt. gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 



tga 

<210> 30 
<211» 254 
<212> PRT 

<213> Artificial Sequence 
<220* 

<223> HHDH P016014-G-08 
<400> 30 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 "l5 

Ser Ala Leu Lys Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Gly Ser Pjie Lys His Lys 'Asp Glu Leu Glu Ala Phe Ala Glu Ala Tyr 

35 40 ( 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

50. 55 60 

Ala Val Thr Ser Ala Phe Gly His Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

lie Phe Ala Leu Glu Phe- Arg Pro lie Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Pro Gin Met Lys Lys Arg Lys Ser Gly His He He 

H5 120 " 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Phe Tyr Tyr Pro Thr Glu Pro Trp 

180 185 " ' 190 

Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 



720 



ggc ggc ttt ccc get ata gaa cgt tgg- ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Ala lie Glu Arg Trp Pro Gly Met Pro Glu * 
"245 250 



768 
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Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

Gly Gly Phe Pro Ala lie Glu Arg Trp Pro Gly Met Pro Glu 
24 5 250 

<210>" 31 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH Mcl/2G5 

<221> CDS 

<222> (1) . . . (765) 

<400> 31 

atg age acc get ate gtc ace aac gtc aaa cat ttt gga ggt atg ggt 48 
Met Ser Thr Ala lie Val Thr Asn Val Lys His. Phe Gly Gly Met Gly 
15 10 " 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin 'Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aacgat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val. Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 
85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 3 36 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 
100 105 no 



aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
13 0 13 5 * 14 0 

tac act teg get cga get ggg get tgt act eta get aat. get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
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145 150 155 



160 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 523 
Lys Glu Leu Gly Glu Tyr Asri- lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 5 76 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr "Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 . 2 05 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca ' 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

ggc ggc ttt cec atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 7 65 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 ~ 250 



<210> 32 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH Mzl/2G5 
<400> 32 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

15 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 ' 75 80 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 125 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
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165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu P ro Tro 
180 185 190 irp 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe- Trp Leu Ala 
, 230 235 ^ o 4 Q- 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 33 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH Mzl . 1A5 

<221> CDS 
<222> (1) . . . (765) 

<400> 33 

atg age ccc get ate gtc act aac gtc aaa cat ttt ggt got atq cat 
Met Ser Pro Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly ' 
1 5 10 15 

acc get ctg agg ctg age gaa get ggt caa acc gtc get tgc cat gat 
Thr Ala Leu Arg Leu Ser Glu Ala Gly Gin Thr Val Ala Cys His A*p 
20 25 30 

gaa age .ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 1*2 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 ' 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

65 70 ~ c * 

/u 75 go 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 go 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta atq 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 
100 105 110 

aat get gtg get teg caa atg aag aag cga aag teg ggg C ac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get -act ccg ttc ggg ccg tgg aag gag eta teg act 432 



48 



96 



240 



288 



136 



384 



32 



WO 2005/018579 



PCT/US2004/004819 



Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 52 8 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 575 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 " 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 " ~ 205 

caa cga. eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672' 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca. 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 ^ 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 34- 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH Msl.lAS 
<400> 34 

Met Ser Pro Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

Thr Ala Leu Arg Leu Ser Glu Ala Gly Gin Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

50 55 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg-Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 
115 120 ~ 125 
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Phe lie 


Thr 


Ser Ala 


Thr 


Pro Phe 


Gly Pro 


Trp 


Lys 


Glu 


Leu 


Ser 


Thr 


130 








135 






140 










Tyr Thr 


Ser 


Ala Arg 


Ala 


Gly Ala 


Cys Thr 


Leu 


Ala 


Asn 


Ala 


Leu 


Ser 


145 






150 






155 










160 


Lys Glu 


Leu 


Gly Glu 


Tyr 


Asn lie 


Pro Val 


Phe 


Ala 


He 


Gly 


Pro 


Asn 






165 






170 










175 




Tyr Leu 


His 


Ser Glu 


Asp 


Ser Pro 


Tyr Phe 


Tyr 


Pro 


Thr 


Glu 


Pro 


Trp 






180 






185 








190 






Lys Thr 


Asn 


Pro Glu 


His 


Val Ala 


His Val 


Lys 


Lys 


Val 


Thr 


Ala 


Leu 




195 






200 








205 








Gin. Arg 


Leu 


Gly Thr 


Gin 


Lys Glu 


Leu Gly 


Glu 


Leu 


Val 


Ala 


Phe 


Leu 


210 








215 






220 










Ala Ser 


Gly 


Ser Cys 


Asp 


Tyr Leu 


Thr Gly 


Gin 


Val 


Phe 


Trp 


Leu 


Ala 


n c 






230 






235 










240 


Gly Gly 


Phe 


Pro Met 


lie 


Glu Arg 


Trp Pro 


Gly Met 


Pro 


Glu 










245 






250 















<210> 35 
<211> 755 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH cysl.10 

<221> CDS 

<222> (1) ... (765) 

<400> 35 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 
Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
15 10 ' 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
'65 70 , 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
He Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 
85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Fhe Ala Leu Val 
100 105 no 
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aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 
115 120 ' 125 



384 



ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get agt act eta get aat get eta teg 430 
Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn -Ala Leu Ser 
145 150 155 iso 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
ISO ' 185 " 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu- His Val Ala His Val Lys Lys Val Thr Ala Leu ' 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 ~ 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 36 
<211> 254 
<212> PRT 

<213> Artificial Sequence. 
<220> 

<223> HHDH cysl.10 
<400> 36 

Met Ser Thr Ala He Val Thr Asn 

1 5 
Ser Ala Leu Arg Leu Ser Glu Ala 
20 

Glu Ser Phe Lys Gin Lys Asp Glu 

• 35 " ~ 40 

Pro Gin Leu Lys Pro Met Ser Glu 

50 55 
Ala Val Thr Ser Ala Tyr Gly Gin 
65 70 
He Phe Ala Pro Glu Phe Gin Pro 



Val Lys His Phe Gly Gly Met Gly 

10 15 
Gly His Thr Val Ala Cys His Asp 
25 30 
Leu Glu Ala Phe Ala Glu Thr Tyr 
45 

Gin Glu Pro Ala Glu Leu He Glu 
60 

Val Asp Val Leu Val Ser Asn Asp 
.75 80 " 

He Asp Lys Tyr Ala Val Glu Asp 
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85 






90 


95 




xyr 


Arg 




Ala Val 


Glu 


Ala Leu 


Gin He Arg Pro Phe Ala 


Leu 


Val 








100 






105 no 




Asn 


Ala 


Val 


Ala Ser 


Gin 


Met Lys 


Lys Arg Lys Ser Gly His 


He 


lie 






115 






120 


125 




Fne 


lie 


Thr 


Ser Ala 


Thr 


Pro Phe 


Gly Pro Trp Lys Glu Leu Ser 


Thr 




130 








135 


140 




Tyr 


Thr 


Ser 


Ala Arg 


Ala Gly Ala 


Ser Thr Leu Ala Asn- Ala 


Leu 


Ser 


145 








150 




155 




ISO 


Lys 


Glu 


Leu 


Gly Glu 


Tyr 


Asn He 


Pro Val Phe Ala He Gly Pro Asn 






His 


165 






170 


175 




Tyr 


Leu 


Ser Glu 


Asp 


Ser Pro 


Tyr Phe Tyr Pro Thr Glu 


Pro 


Trp 








130 






185 190 




Lys 


Thr 


Asn 


Pro Glu 


His 


Val Ala 


His Val Lys Lys Val Thr 


Ala 


Leu 


Gin 




195 






200 


205 






Arg 


Leu 


Gly Thr 


Gin 


Lys Glu 


Leu Gly Glu Leu Val Ala 


Phe 


Leu 




210 








215 


220 




Ala 


Ser 


Gly 


Ser Cys 


Asp Tyr Leu Thr Gly Gin Val Phe Trp 


Leu 


Ala 


225 








230 




235 




240 


Gly 


Gly" 


Phe 


Pro Met 


He 


Glu Arg Trp Pro Gly Met Pro Glu 










245 






■ 250 







<210> 37 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH cys 2 .12 

<221> CDS 
<222> {!)... (765) 

<400> 37 

atg age acc get ate 

Met Ser Thr Ala He 

1 5 

age get ctg agg ctg 
Ser Ala Leu Arg Leu 
20 

gaa age ttt aaa cag 
Glu Ser Phe Lys Gin 
35 

cca cag ctg aaa cca 
Pro Gin Leu Lys Pro 
50 

get gtc acc age get 
Ala Val Thr Ser Ala 
65 

ate ttt get cca gaa 
He Phe Ala Pro Glu 
85 



36 



gtc acc aac gtc aaa cat ttt gga ggt atg ggt 
Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
10 15 



43 



age gaa get ggt cat acc gtc get gcg cat gat 96 
Ser Glu Ala Gly His Thr Val Ala Ala His Asp 
25 30 

aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
40 45 

atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
55 so 

tac ggt cag gtc gat gtc ctg gtc age aac gat 2*0 
Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asd 
70 75 so 

ttt cag cca ate gat aaa tac get gtc gaa gat 288 
Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
90 95 
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tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 110 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He , 

115 120 125 

ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg act 432 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 

145 150 155 160 

aag gag eta gga gag tac aat' ate ccg gtg ttc get ate ggg ccg aat 528 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag. gat teg ccg tac ttc tac ccg act gag ccg tgg 576 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
130 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 * 215 220 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 ' A ^ 230 235 240 



720 



ggc ggc ttt ccc atg ata gaa cgt tgg ceo ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 38 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH cys2 . 12 
<4Q0> 38 



Met Ser Thr Ala lie Val Thr 


Asn Val Lys 


His 


Phe Gly 


Gly 


Met 


Gly 


1 5 


10 








15 




Ser Ala Leu Arg Leu Ser Glu 


Ala Gly His 


Thr 


Val Ala 


Ala 


His 


Asp 


20 


25 






30 






Glu Ser Phe Lys Gin Lys Asp 


Glu Leu Glu 


Ala 


Phe Ala 


Glu 


Thr 


Tyr 



35 40 45 
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Pro 


Gin 


Leu 


uys fro 


Ma t- 


Ser 


Glu 


vain 


Glu 


Pro 


Ala Glu Leu 


He 


Glu 




50 








5 5 










60 






Ala 


Val 


xnr 


Ser Al a 


Tyr 




Gin 


Val 


Asp Val 


Leu Val Ser 


Asn Asp 


65 








70 










75 






80 


lie 


Phe 


ai a 


Pro Glu 


pne 


Gin 


Pro 


lie 


Asp 


Lys 


Tyr Ala Val 


Glu Asp 








85 










90 






95 




Tyr Arg 


(jly 


Ala val 


<jlU 


Ala 


Leu 


Gin 


He 


Arg 


Pro Phe Ala 


Leu Val 








100 








105 






110 






Asn 


Ala 


val 


Ala Ser 


Gin 


Met 


Lys 


Lys 


Arg 


Lys 


Ser Gly His 


lie 


He 






115 








120 








125 






Phe 


He 


Thr 


Ser Ala 


Thr 


Pro 


Phe 


Gly 


Pro 


Trp 


Lys Glu Leu 


Ser 


Thr 




130 








135 










140 






Tyr 


Thr 


Ser 


Ala Arg 


Ala 


Gly 


Ala Cys 


Thr 


Leu 


Ala Asn Ala 


Leu 


Ser 


145 








150 










155 






160 


Lys 


Glu 


Leu 


Gly Glu 


Tyr 


Asn 


He 


Pro 


Val 


Phe 


Ala He Gly 


Pro 


Asn 








165 










170 




175 




Tyr 


Leu 


His 


Ser Glu 


Asp 


Ser 


Pro 


Tyr 


Phe 


Tyr 


Pro Thr Glu 


Pro 


Trp 








130 








185 






190 






Lys 


Thr 


Asn 


Pro Glu 


His 


Val 


Ala 


His 


Val 


Lys 


Lys Val Thr 


Ala 


Leu 






195 








200 








205 






Gin 


Arg 


Leu 


Gly Thr 


Gin 


Lys 


Glu 


Leu Gly Glu Leu Val Ala 


Phe 


Leu 




210 








215 










220 






Ala 


Ser 


Gly 


Ser Cys 


Asp 


Tyr 


Leu Thr Gly Gin Val Phe Trp 


Leu 


Ala 


225 








230 










235 






240 


Gly Gly 


Phe 


Pro Met 


He 


Glu 


Arg 


Trp 


Pro 


Gly Met Pro Glu 












245 










250 











<210> 39 
<211> 855 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh!33c 

<221> CDS 

<222> (1) . . . (852) 

<400> 39 

atg get aaa aac ttt age aat gtc gaa tat cct gec ccg ccg cca get 43 
Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
15 10 15 

cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 ~ 30 

ggc aaa gtc gcg tct ate acc ggt age age tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly lie Gly Tyr Ala 
3 5 40 4 5 

ccg gec gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala She Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 
50 55 60 

aac age cag gat gec acc ggt aaa gca gag gec ctg get aaa aaa tat 240 
Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
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65 70 75 80 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg ' 288 

Gly Val Lys Val Lys Ala Tyr. Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 105 no 

att gtt gtg gcg aac gea ggc ate cca tgg act aag ggt gca tac ate 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 * 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 43 2 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 



aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat tat cgt gaa 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Tyr Arg Glu 
145 150 155 160 



480 



cgt ttc gaa aaa gaa ggc aaa aag ggc gee ttg gtt ttt' aeg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg ace tac aat 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gee aag gca ggc gtg cgt cat ttc gca aag tec ctg gee gtg gaa 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
'195 200 205 

•ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tvr He Asn 
210 215 220 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 "* 240 

age tta gtt cca ttg ggt cgt ggt ggg gaa act gcg gaa tta gtt ggt 763 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gee acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 

gat ate att gtg gat ggc ggc tac acg ctg ccg taa tga 855 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 40 
<211> 283 



39 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> BCRED krhl33c 
<400> 40 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

1 5 io is 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 " 3 0 

Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 go 

Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 " 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

35 90 95 ■ 

Val Lys Gin Thr lie Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 

100 105 . " no 

lie Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 

US 120- 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

.130 135 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Tyr Arg Glu " 
145 150 155 ~ iso 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 

Met Ser Gly His lie Val Asn Val Pro Gin 'Phe Gin Ala Thr Tyr Asn 

180 185 190 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

•195 200 * 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

•210 215 220 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 

Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 41 
<211> 855 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh215 

<221> CDS 

<222> (1) . . , (855) 

<400> 41 

atg get aaa aac ttt age aat gtc gaa tat cct gec ccg ccg cca get 4 8 



40 
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Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro' Pro Pro Ala 
15 10 15 

cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 3 0 

ggc aaa gtc gcg tct ate acc ggt age age tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 
35 40 45 . 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 

aac age cag gat gec acc ggt aaa gca gag gee ctg get aaa aaa tat 24 0 

Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 ~ 80 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 238 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He Lys Asp- Phe Gly His Leu Asp 
100 105 no 

att gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tac ate 384 
He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 * 125 

gat cag gat gac gat aaa cat ttt gac cag gtg att gac gtc gac ctg 432 
Asp -Gin Asp Asp Asp Lys His Phe Asp Gin Val He Asp Val Asp Leu 
130 135 140 

aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat tat cgt gaa 4 80 

Lys Gly Val Gly Tyr Val Ala Lys His' Ala Gly Arg His Tyr Arg Glu 
145 150 155' * 160 ' 

cgt ttc gaa aaa gaa ggc ata aag ggc gee ttg att ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly He Lys Gly Ala Leu He Phe Thr Ala Ser 
165 170 175 

gtg teg ggt cac ate gtt aac att ccg caa ttt cag gcg acc tac aat 576 
Val Ser Gly His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 
130 185 190 

gcg gec aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200. 20S 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 
210 215 220 



acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 



720 
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225 230 



235 240 



age tta gtt cca ttg ggt cgt ggt ggg gaa act gcg gaa tta gtt aat 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr 111 Slu Leu val It 
245 250 2S5 y 



768 



gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser T yr Ala- Thr lly J£ 
260 265 270 

gat ate att gtg gat ggc ggc tac acg ctg ccg taa tga 
Asp lie He Val Asp Gly Gly Tyr Thr Leu Pro * * 
275 280 



855 



<210> 42 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220s 

■<223> KRED krh215 
<400> 42 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

15 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 30 

Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 

50 55 60 

Asn Ser -Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys T Vr 

70 75 so 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 

100 105 110 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 12 5 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val He Asp Val Asn Leu 
• 130 135 140 

. hys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Tyr Arg Glu 

150 155 160 

Arg Phe Glu Lys Glu Gly He L ys Gly Ala Leu He Phe Thr Ala Ser 

165 170 175 

Val Ser Gly His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 

iso ias 190 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

210 215 220 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
2 ' 5 230 235 2 4o 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
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260 265 270 

Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro 
275 " 280 

<210> 43 
<211> 855 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh267 

<221> CDS 

<222> (1) . . . (855) 

<400> 43 

atg get aaa aac ttt age aat gtc gaa tat cct gee ccg ccg cca get 4 8 

Met Ala .Lys Asn Phe Ser Asn 'Val Glu Tyr Pro Ala Pro Pro Pro Ala 
15 10 15 

cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 



ggc aaa gtc geg tct ate acc ggt age age tea ggc att ggt tac geg 144 
Gly Lys Val Ala Ser He Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 
35 40 * 45 

ctg gec gaa get ttt geg cag gtt ggc gca gac gtt geg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 " 60 

aac age "eag gat gec acc ggt aaa gca gag gee ctg get aaa aaa tat 24 0 

Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 " 80 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat geg 28 8 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 . 105 ' 110 

att gtt gtg geg aac gca ggc ate cca tgg act aag ggt gca tac ate 384 
lie Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 
115 120 125 

gat cag gat gac gat aag cat ttt' gac cag gtg att gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val He Asp Val Asp Leu 
130 135 140 

aaa ggc gta ggc tat gta gca aaa cat geg ggt cgc cat ctt cgt gaa 480 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Leu Arg. Glu 
145 150 155 160 

cgt ttc gaa aaa gaa ggc aaa aag ggc gee ttg gtt ttt acg get tec 528 
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Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
1^5 170 175 

acg teg ggt cac ate gtt aac att ccg caa ttt cag gcg acc tac aat 
Thr Ser Gly His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 



195 200 



205 



225 230 235 



240 



gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg ggc aca 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 ' 270 



576 



gcg gec aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 



ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
21 ° 215 . 22 0 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 



720 



age tta gtt cca ttg ggt cgt ggt ggg gaa act gcg gaa tta .gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 



816 



gat ate att gtg gat ggc ggc tac acg ctg ccg taa tga • 355 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * * 
275 280 



<210> 44' 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh267 



<400> 44 



Met 


Ala 


Lys 


Asn 


Phe 


Ser Asn Val 


1 








5 




His 


Thr 


Lys 


Asn 


Glu 


Ser Leu Gin 








20 






Gly. Lys 


Val 


Ala 


Ser 


lie Thr Gly 






35 






40 


Leu 


Ala 


Glu 


Ala 


Phe 


Ala Gin Val 




50 








55 


Asn 


Ser 


Gin 


Asp 


Ala 


Thr Gly Lys 


65 










70 


Gly Val 


Lys 


Val 


Lys 


Ala Tyr Lys 










85 




Val 


Lys 


Gin 


Thr 


He 


Glu Gin Gin 








100 






He 


Val 


Val 


Ala 


Asn 


Ala Gly He 






115 






120 



Glu Tyr Pro Ala Pro Pro Pro Ala 

10 15 
Val Leu Asp Leu Phe Lys Leu Asn 
25 30 
Ser Ser Ser Gly He Gly Tyr Ala 
45 

Gly Ala Asp Val Ala He Trp Tyr 
60 

Ala Glu Ala Leu Ala Lys Lys Tyr 

75 * 80 

Ala Asn Val Ser Ser Ser Asp Ala 

90 95 
lie Lys Asp Phe Gly His Leu Asu 
105 no 
Pro Trp Thr Lys Gly Ala Tyr He 
125 
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Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val lie Asp Val Asp Leu 
130 135 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Leu Arg Glu 
145 150 155 iso 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 

Thr Ser Gly His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 

180 185 i9o 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 ' 205 . 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

210 215 220 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe.- Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 

Asp He He Val Asp .Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 45 
<211> 855 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh2 8 7 

<221> CDS 

<222> (1) . . . (855) 

<400> 45- 

atg get aaa aac ttt age aat gtc gaa tac cct gec ccg ccg cca get 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

1 5 10 is 

cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc. aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu .Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 

ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 

aac age cat gat gec acc ggt aaa gca gag gee ctg get aaa aaa tat 240 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 ~ " 80 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 
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?2 s a s si is a a x 5 - - - s s s 

100 1Q 5 110 ' 

att gtt gtg gcg aac gca ggc ate cca tgg act aag got qca tzr .hr 

He Val Val Ala Asn Ala Gly He Pro Tr? Thr Lys S!y Sa ^r S 

120 125 

gat cag gat gac gat aaa cat ttt gac cag qta ata aac oho .. 
Asp Gin Asp Asp Asp Lys His Phe Isp G1 * ?2 Sp VaJ ^ Leu 



140 



aaa ggc gta ggc tat gta gca aaa cat gcg qqt cac cat tfeh nrr*- ~ a 
Lys Gly Val Gly Tyr V al Ala Lys His Id Ty Arg His Pne S ^ 

cgt ttc gaa aaa gaa ggc aaa aag ggc gec ttg gtt ttt acg get tec 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 



175 



atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg ace tac aat 
Met ser Gly His He Val Asn Val Pre Gin Phe Gin S ?hr Tyr Asn 



190 



235 



240 



•age tta gtt cca ttg ggc cgt ggt ggg gaa act gcg gaa tta gtt qqt 
Ser Leu Val Pro Leu Gly Arg Gly Gly. Glu Thr Ala Glu Leu Val Gly 
24 5 250 



255 



gee tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acq qqc aca 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ila S SlJ ?hr 
2 60 265 



270 



gat ate att gtg gac ggc ggc tac acg ctg ccg taa tga 
Asp He lie Val Asp Gly Gly Tyr Thr Leu Pro * * 
27 5 x • 280 



<210> 46 
<211> 233 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh2 8 7 



336 



384 



i30 



523 



576 



gcg gec aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 6^4 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ila Va! Glu 
195 200 205 



S72 



ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro lly Tyt ill Asn 
210 21 5 220 

ace gag- ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 720 
Thr Glu -He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Tr? 



768 



816 



855 
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<400> 4b 



Met 


Ala 


Lys 


Asn 


Phe 


Ser 


Asn 


Val Glu Tyr Pro Ala Pro Pro Pro Ala 


1 








5 






10 15 


His 


Thr 


Lys 


Asn 


Glu 


Ser 


Leu 


Gin Val Leu Asp Leu Phe Lys Leu Asn 








20 








25 30 


Gly 


Lys 


Val 


Ala 


Ser 


He 


Thr 


Gly Ser Asn Ser Gly He Gly Tyr Ala 






35 










40 * 45 


Leu 


Ala 


Glu 


Ala 


Phe 


Ala 


Gin 


Val Gly Ala Asp Val Ala lie Trp Tyr 




50 










55 


60 


Asn 


Ser* 


His 


Asp 


Ala 


Thr 


Gly Lys Ala Glu Ala Leu Ala .Lys Lys Tvr 


65 










70 




75 80 


Glv 


Val 


Lys 


Val 


Lys 


Ala 


Tyr 


Lys Ala Asn Val Ser Ser Ser Asp Ala 










85 






90 95 


Val 


LiVS 


Gin 


Thr 


He 


Glu 


Gin 


Gin lie Lys Asp Phe Gly His Leu Asp 








100 








105 ~ no 


lie 


Val 


v aJ. 


Al a 


Asn 


ai a 


Gly 


He Pro Trp Thr Lys Gly Ala Tyr lie 






115 










120 125 


21 en 
nop 




Asp 


ASp 


Asp 


Lys 


His 


Phe Asp Gin Val Val Asp Val Asp Leu 




13 0 










135 


140 


T >\rzt 


nl w 


v cL J. 


wj, y 


Tyr 


Val 


Ala 


Lys His Ala Gly Arg His Phe Arg Glu 


145 










150 




155 " 160 


Arg 




pin 


Lys 


blU 


Lj_L y 


Lys 


Lys Gly Ala Leu Val Phe Thr Ala Ser 
















170 • 175 


Met- 
ric U 


Ser 


ul y 


nlb 


Tl o 
lie 


V=l 
Val 


Asn 


Val Pro Gin Phe Gin Ala Thr Tyr Asn 








180 








185 190 


Ala 


Ala 


Lys 


Ala 


Gly 


Val 


Arg 


His Phe Ala Lys Ser Leu Ala Val Glu 






195 










200 205 


Phe 


Ala 


Pro 


Phe 


Ala 


Arg 


Val 


Asn Ser Val Ser Pro Gly Tyr lie Asn 




210 










215 


220 


Thr 


Glu 


He 


Ser 


Asp 


Phe 


Val 


Pro Gin Glu Thr Gin Asn Lys Trp Trp 


225 










230 




235 240 


Ser 


Leu 


Val 


Pro 


Leu 


Gly 


Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 










245 






250 255 


Ala 


Tyr 


Leu 


Phe 


Leu 


Ala 


Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 








260 








265 270 


Asp 


He 


He 


Val 


Asp 


Gly 


Gly Tyr Thr Leu Pro 



275 280 



<210> 47 
<211> 855 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh320 

<221> CDS 

<222> (1) . . . (855) 

<400> 47 

atg get aaa aac ttt age aat gtc gaa tac cct gec ccg ccg cca get 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1 -5 10 15 

cat ace aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 
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ggc aaa gtc gcg tct ate acc 'ggt age aac tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 
35 40 45 

etg gec gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 so 



aac age cat gat gec acc ggt aaa gca gag gee ctg get aaa aaa tat 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 



80 



atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg acc tac aat 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 



acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 9 4 0 



240 



ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt tat gcg 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Tyr Ala 
85 go 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 33 6 

Val Lys. Gin Thr He Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 
100 105 no 

■ att gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tac ate 334 
He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 " 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 

•aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 480 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 " 150 155 160 

cgt tte gaa aaa gaa ggc aaa aag ggc gee ttg gtt ttt acg get tee 5 28 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 



576 



gcg gec aag gca ggc gtg cgt cat ttc gca aag tec ctg gee gtg gaa 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggo tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 
210 215 220 



720 



age tta gtt cca ttg ggc cgt ggt ggg gaa act gcg gaa tta gtt agt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 



4S 
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gcc tac ctg ttc ctg gca agt gat gcg g gc tec tac gec acg ggc aca 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly *Ser Tyr Ala Thr Gly Thr 
260 265 270 



81b 



gat ate att gtg gac ggc ggc tac acg ctg ccg taa tga 
Asp lie lie Val.Asp Gly Gly Tyr Thr Leu Pro * * 
275 230 



855 



<210> 43 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh320 



<400> 48 



1*162 U 


Ala 


Lys 


Asn 


trie 


Ser 


ash vai uxu iyr rio AJ.a 


Pro Pro Pro Ala 


1 








r 

b 




10 


15 


11 J.a 


Thr 


Lys 


Asn 




Ser 


jjeu. oxxi vai ijeu Asp Lteu 


Phe Lys Leu Asn 








& U 






ZD 


3 0 




Lys 


Val 


nla 


Ser 


lie 


TVi >- 1 i/ Oar fieri Cay f~* ~\ t p 

lxii oiy Qci Asn aer oJ.y 


He Gly Tyr Ala 






35 








a n 


45 


Leu 


Ala 


Glu 


Ala 


On a 

rue 


A J. a. 


Pi n T7n 1 pi v Airs 7\ — ~ Tr_ 1 

uiii vol Kj±y AJ.a Asp vaj. 


Ala He Trp Tyr 




50 










55 60 




Asn 


Ser 


His 


Asp 


Ala 


Thr 


Gly Lys Ala Glu Ala Leu 


Ala Lys Lys Tyr 


65 










70 


75 


80 


Gly 


Val 


Lys 


Val 


Lys 


Ala 


Tyr Lys Ala Asn Val Ser 


Ser Ser Tyr Ala 










85 




90 


95 


Val 


Lys 


Gin 


Thr 


He 


Glu 


Gin Gin lie Lys Asp Phe 


Gly His Leu Asp 








100 






• 105 


110 


He 


Val' 


Val 


Ala 


Asn 


Ala 


Gly He Pro Trp Thr Lys 


Gly Ala Tyr He 






115 








120 


125 


Asp 


Gin Asp 


Asp 


Asp 


Lys 


His Phe Asp Gin Val Val 


Asp Val Asp Leu 




130 










135 140 




Lys 


Gly Val 


Gly 


Tyr 


Val 


Ala Lys His Ala Gly" Arg 


His Phe Arg Glu 


145 










150 


15 5" 


160 


Arg 


Phe 


Glu 


Lys 


Glu 


Gly 


Lys Lys Gly Ala Leu Val 


Phe Thr Ala Ser 










165 




170 


175 


Met 


Ser Gly 


His 


He 


Val 


Asn Val Pro Gin Phe Gin 


Ala Thr Tyr Asn 








180 






185 


190 


Ala 


Ala Lys 


Ala 


Gly 


Val 


Arg His Phe Ala Lys Ser 


Leu Ala Val Glu 






195 








200 


205 


Phe 


Ala 


Pro 


Phe 


Ala 


Arg 


Val Asn Ser Val Ser Pro 


Gly Tyr He Asn 




210 










215 220 




Thr 


Glu 


He 


Ser 


Asp 


Phe 


Val Pro Gin Glu Thr Gin 


Asn Lys Trp Trp 


225 










230 


235 


240 


Ser 


Leu Val 


Pro 


Leu 


Gly 


Arg Gly Gly Glu Thr Ala 


Glu Leu Val Gly 










245 




250 


255 


Ala 


Tyr Leu 


Phe 


Leu 


Ala 


Ser Asp Ala Gly Ser Tyr 


Ala Thr Gly Thr 








260 






265 


270 


Asp 


He 


He 


Val 


Asp 


Gly 


Gly Tyr Thr Leu Pro 








275 








280 





<210> 49 
<211> 852 . 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh326 

<221> CDS 

<222>" (1) . . . (852) 

<400> 49 



12 lit ok" 390 9tC 933 tac cct occ c ^ =cg cca gcf 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro £ 

1 ' 5 « 15 



cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa eta «r 
H« Thr Lys Asn Glu Ser La, Gin Val Leu Lp Le ? J£ J* ™ 

20 2 5 



30 



ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac acq 
Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Sly lie lly Tyr All 



45 



ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp val La He Tr| Tyr 



SO 



aac age cat gat gee acc ggt aaa gca gag gee ctg qct aaa aaa tat- 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu La Lys Lys T yr 

ggc gta aaa gtc aag get tat aaa get aat gtc aqc tea aat a=,t- anr, 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Sel S Asp 111 
.85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt qqc cac eta aar 
Val Lys Gin Thr lie Glu Gin Gin He Ly? Lp Phe lly Ss Leu. Asp 
100 105 110 

att gtt gtg gcg aac gca ggc ate cca tgg act aag got oca tac atr 
He Val Val Ala Asn Ala Gly lie Pro Trp Thr Lys lly La j£ Til 
115 120 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctq w 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Va! Lp Val Sp Leu 
130 13S 140 

aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cct aaa 
jys Gly Val Gly Tyr Val Ala Lys His La lly Arg His Phe Arg III 
145 150 155 



160 



cgt ttc gaa aaa gaa ggc aaa aag ggc gec ttq att ttt arrr ant- hr^ 
Arg Phe Glu Lys Glu Gly Lys Lys g y La Leu Val Phe Thr La Ser 



175 



atg teg ggt cac ate gtt aac gtg ccg caa ttt cag ccq acc tac aat 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ah ?hr Tyr Tsl 
180 185 190 



96 



144 



192 



240 



288 



336 



384 



480 



528 



576 



50 
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gcg gtc aag gca ggc gtg cgt cac ttc gca aag tec ctg gec gtg gaa 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 



acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat .aaa tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 ' 240 



720 



age tta gtt cca ttg ggt cgt ggt ggg gaa act gcg gaa tta gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gee acg ggc aca 816 

Ala Tyr Leu Phe Leu .Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 ^ 265 * 270 

gat ate att gtg gac ggc ggc tac acg ctg ccg taa 852 

Asp- He He Val . Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 50 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh3 2 6 
<400> 50 

Met "Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

15 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 30 

Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 

35- 40 45 ' 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 * 60 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 7 0 75 ~ " 8 0 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 90 95 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 

100 105 ~ 110 

He Val Val Ala' Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 ' 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
' 165 170 175 



51 
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Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 

180 18 5 190 

Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 

210 215 2 ->Q 

Thr Glu lie Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 

o 230 2 35 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 ?55 

-la Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 51 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<22Q> 

<223> KRED-krh408 



<221> CDS 

<222> (1) . . . (852) 



<400> 51 

atg get aaa aac ttt age aat gtc gaa tac cct gec ccg ccg cca get 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1 5 10 15 

oat acc aaa aac gaa tea ctg cag gta etg gat ctg ttc aaa ctg aac 

Has Thr "Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 



20 25 



30 



48 



9b 



ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac gcg 144 

Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 
35 40 ■ 45 

ctg gec gaa get ttt gcg cag get ggc gca gac gtt gcg ate tgg tat m 

Leu Ala Glu Ala Phe Ala Gin Ala Gly Ala Asp Val 111 He rtl T Jr 
' 50 55 60 .- y 



240 



283 



aac age cat gat gec ace ggt aaa gca gag gee ctg get aaa aaa tat 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 go 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcq 
^ly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
35 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe G?y His Leu Asp 
100 105 110 

att gtt gtg gcg aac gca ggc ate cca tgg act aag ggt qca tac ate 3 34 

lie Val val Ala Asn Ala Gly He Pro Trp Thr Lys 3ly fla Tyr He 



52 
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115 120 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 

aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 480 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

cgt tec gaa aaa gaa ggc aaa aag ggc gec ttg gtt ttt acg get tec 528 
Arg Ser Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg acc tac aac 576 
Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gtc aag" gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa ' 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 .205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 
' 210 215 • 220 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 72 0 

Thr Glu lie Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

age tta gtt cca ttg ggt cgt ggt ggg .gaa act gcg gaa tta gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gee acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 

gat ate att gtg gac ggc ggc tac acg ctg ccg taa 852 
Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 52 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh408 
<400> 52 

Met Ala Lys Asn Phe Ser Asn Val 

1 5 
His Thr Lys Asn Glu Ser Leu Gin 
20 

Gly Lys Val Ala Ser He Thr Gly 



Glu Tyr Pro Ala Pro Pro Pro Ala 

10 15 
Val Leu Asp Leu Phe Lys Leu Asn 
25 30 
Ser Asn Ser Gly He Gly Tyr Ala 
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35 40 45 

Leu Ala Glu Ala Phe Ala Gin Ala Gly Ala Asp Val Ala He Trp Tvr 

50 55 60 * 

Asn Ser .His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 ™ 75 ~ 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 "90 95 

Val Lys Gin Thr He Glu 'Gin Gin He Lys Asp Phe Gly- His Leu Asr> 
100 105 no " 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 " leo 

Arg Ser Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 

Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 

180 185 190 ' 

Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

• Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

210 215 220 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp. Trp 
225 230 235 240 

Ser Leu Val Pro -Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 53 
<211> 852 
.<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED krh4l7 

<221> CDS 

<222> (1) . . . (852) 

<400> 53 

atg get aaa aac ttt age aat gtc gaa tat cct gec ccg ccg cca get 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

15 10 15 



48 



cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 " 30 

ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac gcg 144 

Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser Gly lie Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 



54 
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Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 
50 55 60 

aac age cat gat gec acc ggt aaa gca gag gec ctg get aaa aaa tat 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 

65 70 75 80 

ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 90 95 



240 



288 



• gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He -Lys Asp Phe Gly His Leu Asp 
100- 105 no 

att gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tac ate 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 
115 120 ~ ' 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 

aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 4 80 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 iso 

cgt ttc gaa aaa gaa ggc aaa aag ggc gec ttg gtt ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac ate gtt aac att ccg caa ttt cag gcg acc tac aat 576 
Met Ser -Gly His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gee aag gca ggc gtg cgt cat ttc gca aag tec ctg gee gtg gaa 624 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu- Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat " 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

age tta gtc cca ttg ggt cgt ggt ggg gaa act gcg gaa tta gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gee tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 ^ 270 

gat ate att gtg gat ggc ggc tac acg ctg ccg taa 8 52 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
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275 280 



<210> 54 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh417 



<400> 54 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

} 5 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 - 30 

Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 60 • 

Asn Ser His As P Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 90 95 

Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 
100 105 110 *■ 

lie Val Val Ala Asn Ala Gly He Pro Trp Thr Lvs Gly Ala Tyr lie 

H5 120 1?5 . 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 

Lys bly Val Gly Tyr Val Ala Lys His Ala Gly. Arg His Phe Arg Glu 

I - , 155 160 

Arg Phe ulu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

Met Ser Gly His He Val Asn He Pro Gin Phe Gin Ala Thr iyr Asn 

130 las 19Q 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 

210 215 220 

Thr Glu He Ser As P Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 

230 235 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 955 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 - o 70 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 55 
<211> 852 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> KHED krh483 
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<221> CDS 

<222> (1) . . . (852) 

<400> 55 

atg get aaa aac ttt tec aat gtc gaa tat cct gec ccg ccg cca get 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1 5 io Is- 



aac age cat gat gee ace ggt aaa gca gag gee ctg get aaa aaa tat 
Asn Ser His Asp Ala- Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 "so 



gat cag gat gac gat aaa cat ttt gac cag gtg gtg gae gtc gac ctg. 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val' Val Asp Val Asp Leu 
130 135 140 



48 



cat acc aaa aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 " 30 

ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tae gcg 144 
Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin "Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 



240 



ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 288 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 105 no 

ata gtt "gtg gcg aac gca ggc ate cca tgg act aag ggt gca tae ate 3 84 

lie Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr lie 
115 120 125 



43: 



aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 430 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

cgt ttc gaa aaa gaa ggc aaa aag ggc gee ttg gtt ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg acc tac aat - 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gtc aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
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Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aaa tgg tgg 7?o 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

age tta gtt cca ttg ggc cgt ggc ggg gaa act gcg qaa tta gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 270 

gat ate att gtg gac ggc ggc tac acg ctg ccg taa 8^2 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 56 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh4S3 
<400> 56 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

15 10 15 

His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 " 

Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 so 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 7 0 75 "* • 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 90 95 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 

100 105 " no 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 ' 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 

Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 

180 185 190 

Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 . 215 220 
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Thr 


Glu He 


Ser Asp 


Phe Val 


Pro 


225 








230 




Ser 


Leu Val 


Pro 


Leu 


Gly Arg 


Gly 








245 






Ala 


Tyr Leu 


Phe 


Leu 


Ala Ser 


Asp 






260 








Asp 


He He 


Val 


Asp 


Gly Gly 


Tyr 




275 








230 



<210> 57 
<2H> 852 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> KRED krh476 

<221> CDS 

<222> (1) . . . (852) 



Gin Glu Thr Gin Asn Lys Trp Tro 
235 240 
Gly Glu Thr Ala Glu Leu Val Gly 

250 255 
Ala Gly Ser Tyr Ala Thr Gly Thr 
265 270 
Thr Leu Pro 



<400> 57 

atg get aaa aac ttt tec aat gtc gaa tat cct gec ccg ccg cca get 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1.5 10 15 



cat acc aaa gac gaa tea ccg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asp Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 



gge aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tae gcg 144 
Gly Lys Val A.la Ser lie Thr Gly Ser Asn Ser Gly lie Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 " 60 

aac age cat gat gee acc ggt aaa gca gag gee ctg get aaa aaa tat 240 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 ' 80 



ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 28 8 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
35 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 33 6 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 
100 105 110 



att gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tae ate 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 43 2 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 * ■ 
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aaa ggc gta ggc tat gta gcg aaa cat gcg ggt cgc cat ttt cgt qaa 
Lys Gly Val Gly Tyr Val Ala Lys Kis Ala Gly Arg His Phe Arg Glu 
145 150 155 160 

cgt ttc gaa aaa gaa ggc aaa aag ggc gcc ttg gtt ttt acg get tec 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 



165 170 



175 



190 



195 200 



205 



gat atc'att gtg gac ggc ggc tac acg ctg ccg taa 
Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro * 
275 230 



480 



528 



atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg acc tac a-t 576 
Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Thr Tvr Asn 
180 185 * 



gcg gtc aag gca ggc gtg cgt cat ttc gca aag tec ctg gcc gtg gaa 6^4 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 



ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 



720 



768 



acc gag ate tct gat ttc gtc ccg caa gaa aca cag aat aaa tgg tgg 
•Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Tro 
225 230 235 2 40 

age tta gtt cca ttg ggc cgt ggt ggg gaa act gcg gaa tta gtt ggt 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gcc tac ctg ttc ctg gca agt gat gcg ggc tec tac gee acg ggc aca 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tvr Ala Thr Gly Thr 
260 265 * 270 



852 



<210> 58 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED krh4 7 6 



<400> 58 










Met Ala 
1 


Lys 


Asn 


Phe 


Ser 


Asn Val 


His Thr 


Lys 


Asp 


5 

Glu 


Ser 


Leu Gin 






20 








Gly Lys 


Val 


Ala 


Ser 


He 


Thr Gly 




35 








40 


Leu Ala 


Glu 


Ala 


Phe 


Ala 


Gin Val 


50 










55 


Asn Ser 


Kis 


Asp 


Ala 


Thr 


Gly -Lys 


65 








70 


Gly Val 


Lys 


Val 


Lys 


Ala 


Tyr Lys 



Glu Tyr Pro Ala Pro Pro Pro Ala 

10 15 
Val Leu Asp Leu Phe Lys Leu Asn 
25 30 
Ser Asn Ser Gly He Gly Tyr Ala 
4 5 

Gly Ala Asp Val Ala He Trp Tyr 
60 

Ala Glu Ala Leu Ala Lys Lys Tyr 

7 5 ' 80 

Ala Asn Val Ser Ser Ser Asp Ala 
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85 90 95 



Val 


Lys 


Gin 


Thr 


He Glu Gin 


Gin He Lys Asp 


Phe Gly His Leu Asp 








100 




105 


110 


lie 


Val 


Val 


Ala Asn Ala Gly 


He Pro Trp Thr 


Lys Gly Ala Tyr lie 






115 






120 


125 


Asp 


Gin 


Asp 


Asp Asp Lys His 


Phe Asp Gin Val 


Val Asp Val Asp Leu 




130 






135 




140 


Lys 


Gly 


Val 


Gly Tyr Val Ala 


Lys His Ala Gly 


Arg His Phe Arg Glu 


145 








150 


155 


160 


Arg 


Phe 


Glu 


Lys 


Glu Gly Lys 


Lys Gly Ala Leu 


Val Phe Thr Ala Ser 










165 


170 


175 


Met 


Ser 


Gly 


His 


He Val Asn 


Val Pro Gin Phe 


Gin Ala Thr Tyr Asn 








180 




185 


190 


Ala 


Val 


Lys 


Ala 


Gly Val Arg 


His Phe Ala Lys 


Ser Leu Ala Val Glu 






195 






200 


205 


Phe 


Ala 


Pro 


Phe 


Ala Arg Val 


Asn' Ser Val Ser 


Pro Gly Tyr He Asn 




210 






215 




220 


Thr 


Glu 


He 


Ser 


Asp Phe Val 


Pro Gin Glu Thr 


Gin Asn Lys Trp Trp 


225 








230 


235 


240 


Ser 


Leu 


Val 


Pro 


Leu Gly Arg 


Gly Gly Glu Thr 


Ala Glu Leu Val Gly 










245 


250 


255 


Ala 


Tyr 


Leu 


Phe 


Leu- Ala Ser 


Asp Ala Gly Ser 


Tyr Ala Thr Gly Thr 








260 




265 


270 


Asp 


He 


He 


Val 


Asp Gly Gly 


Tyr Thr Leu Pro 








275 






280 





<210> 59 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED * kr h4 9 5 

<221> CDS 

<222> (1) . . . (852) 

<400> 59 

atg get aaa aac ttt tec aat gtc gaa tat cct gec ccg ccg cca get 48 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 
1^*5 10 15 

cat acc aaa gac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
His Thr Lys Asp Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 

ggc aaa gtc gcg tct ate acc ggt age age tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 
35 -40 45 

ctg gee gaa gee ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192" 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tyr 
50 55 60 

aac age cag gat gec acc ggt aaa gca gag gee ctg get aaa aaa tat 24 0 

Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 80 
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% SS S !S "I S £ £ E £ S £ S2 E 5 H 283 

85 so 9s 



gtg aaa cag act att gag cag cag ate aag gat ttt qqc cac eta a »r 
Val Lys Gin Thr lie Glu Gin Gin He Lys Lp Phe l?y Sis Leu fsp 



110 



att gtt gtg gcg aac gca ggc ate cca tgg act aaq qqt aca tac stn 
■He Val. Val Ala Asn Ala Gly lie Pro T?p Thr Ly i §? y K ^ 
115 120 



125 



gat cag gat gac gat aaa cat ttt gac cag gtg att gac gtc gac ctq 
Asp Gin Asp Asp .Asp Lys His Phe Asp Gin Va! lie Lp Val Lp Leu 



140 



aaa ggc gta ggc tat gta gca aaa cat gcg qqt cac cat Vat- ™t- 
Lys Gly Val Gly Tyr Val Ala Lys His 111 lf y J£ His Tyr Sg Ilu 



155 



160 



cgt ttc gaa aaa gaa ggc ata aag ggc g CC ttg att ttt acg get tec 
Arg Phe Glu Lys Glu Gly He Lys Gly Ala Leu lie Phe Thr Ala Ser 



165 170 



175 



gtg teg ggt cac ate gtt aac att ccg caa ttt cag gcg acc tac aat 
Val Ser Gly His lie Val Asn lie Pro Gin Phe Gin 111 S£ j£ *sn 



180 



190 



gcg gec aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 
Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu La ?,! SI 



195 200 



205 



ttt get cct ttc gca cgt gtt aac tct gta tct cet ggc tat att aat 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gxy Tyr lie Asn 
210 215 220 

Thr r? 9 f ' Ct ? at " C 9tC CCS Caa 9aa aca caa aat aaa tgg tgg 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp T^p 



235 



240 



HI " a 2" ° Ca ttS 9gt e 9 fc 99 1 939 9aa act gcg gaa tta gtt ggt 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val 111 
245 - - - 



250 255 



gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg gqc aca 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr La Thr Gxy xhr 



260 265 

gat ate att gtg gat ggc ggc tac acg ctg ccg taa 

Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 60 
<211> 283 
<212> PRT 



270 



336 



384 



432 



480 



528 



576 



624 



672 



720 



763 



816 



85: 



62 
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<213> Artificial Sequence 
<220> 

<223> KRED krh495 
<400> 60 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Pro Pro Ala 

1 5 10 15 

His Thr Lys Asp Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 .30 

Gly Lys Val Ala Ser lie Thr Gly Ser Ser Ser Gly He Gly Tyr Ala 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

50 55 60 

Asn Ser Gin Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 "* ~ 80 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

85 • • 90 95 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Asp 

100 105 no 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 

115 120 125 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val He Asp Val Asp Leu 

130 135. 140 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Tyr Arg Glu 
145 150 155 ~ ' 160 

Arg Phe Glu Lys Glu Gly He Lys Gly Ala Leu lie Phe Thr Ala Ser 

165 170 175 

Val Ser Gly. His He Val Asn He Pro Gin Phe Gin Ala Thr Tyr Asn 

180 1 135 130 

Ala Ala Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

195 200 < 205 

Phe Ala' Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 

210 215 220 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 " 230 ' 235 " * 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 255 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 ~ 265 270 

Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210=» 61 
<211> 789 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GDH 2313 

<221> CDS 

<222> {!)... (789) 

<400> 61 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 48 
Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 
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10 15 



tea ggg etc gga aag gcg atg gec att cgc ttc ggc aag gag cag qca 
Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 
20 25 



30 



aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag qta 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro- Asn Glu Val 
3 5 4 0 



45 



aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa qqa ' 
Lys Glu Glu Val lie Lys Ala Gly Gly Glu Ala Val Val Val Gin Glv 
5 0 55 * 



60 



gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 



75 80 



aag gag ttc ggc aca etc gat att atg att aat aat gec ggt ctt gaa 
Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 



85 - 90 



95 



100 105 



110 



125 



aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec 
Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 



130 •• 135 



140 



agt gtg cac gaa gtg att cet tgg ccg tta ttt gtc cae tat gcg gca 
Ser Val His Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala -Ala 

1 4 R -ten J 



145 150 155 



160 



agt aaa ggc ggg atg aag ctg atg aca gaa aca tta gcg ttg gaa tac 
Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 



155 170 



175 



190 



acg acg ate aat get gag aaa ttt get gac cet aaa cag aaa get gat 
Thr Thr He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asd 
195 200 * * 



205 



210 215 



220 



gee gca gta gca gee tgg etc get teg aag gaa gec age tac gtc aca 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 235 - 240 



96 



144 



192 



240 



288 



aat cet gtg cca tct cac gaa atg ccg etc aag gat tgg gat aaa gtc 31s 
Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 



ate ggc acg aac tta acg ggt gee ttt tta gga age cgt gaa gcg att 38* 
He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala He 
H5 120 



43: 



480 



528 



gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt gcg ate aac 576 
Ala Pro Lys Lily He Arg Val Asn Asn He Gly Pro Gly Ala lie A^n 
18 0 185 



624 



ff, ^ 9<= at9 att ° Ca at9 " a tat atC sgc 9 aa cc 9 Sag gag ate 6 7^ 

Val Glu Ser Met lie Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 



720 
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ggc ate acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea ttc 768 
Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
245 250 * 255 

cag gca ggc cgc ggt taa tga 789 
Gin Ala Gly Arg Gly * * 
260 



<210> 62 
<211> 261 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GDH 2313 
<400> 62 



Met 


Tyr 


Pro 


Asp 


Leu 


Lys 


Gly Lys Val 


Val Ala lie Thr 


Gly. Ala Ala 


1 
















XL) 


15 


Ser Gly 


Leu 


Gly 


Lys- 


Ala 


Met 


Ala He 


Arg Phe Gly Lys 


Glu Gin Ala 








20 








25 




30 


Lys 


Val 


val 


He 


Asn 


Tyr 


Tyr 


Ser Asn 


Lys Gin Asp Pro 


Asn Glu Val 






35 










40 


45 




Lys 


Glu 


Glu 


Val 


He 


Lys 


Ala Gly Gly 


Glu Ala Val Val 


Val Gin Gly 




50 










55 




60 




Asp 


Val 


Thr 


Lys 


Glu 


Glu 


Asp 


Val Lys 


Asn He Val Gin 


Thr Ala lie 


65 










70 






75 


80 


Lys 


Glu 


Phe 


Gly 


Thr 


Leu 


Asp 


He Met 


lie Asn Asn Ala 


Gly Leu Glu 










85 








90 


95 


Asn 


Pro 


Val 


Pro 


Ser 


His 


Glu 


Met Pro 


Leu Lys Asp Trp 


Asp Lys Val 








100 








105 




110 


He Gly Thr 


Asn 


Leu 


Thr 


Gly Ala Phe 


Leu Gly Ser Arg 


Glu Ala He 






115 










120 


125 




Lys 


Tyr 


Phe 


Val 


Glu 


Asn 


Asp 


He Lys 


Gly Asn Val lie 


Asn Met Ser 




130 










135 




140 




Ser 


Val 


His 


Glu 


Val 


He 


Pro 


Trp Pro 


Leu Phe Val His 


Tyr Ala Ala 


145 










150 






155 


160 


Ser 


Lys 


Gly 


Gly 


Met 


Lys 


Leu 


Met Thr 


Glu Thr Leu Ala 


Leu Glu Tyr 










165 








170 


175 


Ala 


Pro 


Lys 


Gly 


He 


Arg 


Val 


Asn Asn 


He Gly Pro Gly 


Ala lie Asn 








180 








185 




190 


Thr 


Thr 


He 


Asn 


Ala 


Glu 


Lys 


Phe. Ala 


Asp Pro Lys Gin 


Lys Ala Asp 






195 










200 


205 




Val 


Glu 


Ser 


Met 


He 


Pro 


Met 


Gly Tyr 


He Gly Glu Pro 


Glu Glu lie 




210 










215 




220 




Ala 


Ala 


Val 


Ala 


Ala 


Trp 


Leu 


Ala Ser 


Lys Glu Ala Ser 


Tyr Val Thr 


225 










230 






235 


240 


Gly 


He 


Thr 


Leu 


Phe 


Ala 


Asp Gly Gly 


Met Thr Gin Tyr 


Pro Ser Phe 



245 250 255 

Gin Ala Gly Arg Gly 
260 

<210> 63 ' 
<211> 789 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> GDH 2331 

<221> CDS 

<222> (1) . . . (789) 

<400> 63 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 
Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala lie Thr Gly Ala Ala 
1.5 10 1S 



ISO 



48 



tea ggg etc gga aag gcg atg gec att cgc ttc ggc aag gag cag gca 96 
Ser Gly Leu Gly Lys Ala Met Ala lie Arg Phe Gly Lys 'Glu Gin Ala 
20 25 * " 30 

•aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag gta 144 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
35 40 45 

aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 
Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 
50 55 60 

gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 240 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 80 

aag gag ttc ggc aca etc gat att atg att aat aat gec ggt ctt qaa 2 88 

Lys Glu Phe Gly Thr Leu Asp He Met He Asn. Asn Ala Gly Leu Glu 
85 90 95 

aat cct gtg cca tct cac gaa atg ccg etc aag gat tgg gat aaa gtc 336 
Asn' Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 
100 105 " no 

ate ggc acg aac tta acg ggt gec ttt tta gga age cgt gaa gcg att 3 84 

He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 
H5 120 125 

aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec ■ 432 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 
130 135 140 

agt gtg cac gaa gtg att cct tgg ccg tta ttt gtc cac tat gcg gca 480 
Ser Val His Glu Val lie Pro Trp Pro Leu Phe Val Kis Tyr Ala Ala 
145 150 155 



agt aaa ggc ggg atg aag ctg atg aca gaa aca tta gcg ttg gaa tac 528 
Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 
165 170 175 

gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt gcg ate aac 576 
Ala Pro Lys Gly lie Arg Val Asn Asn lie Gly Pro Gly Ala lie Asn 
180 185 190 
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acg cca ate aat get gaa aaa ttc get gac cct aaa cag aaa get gat 624 
Thr Pro lie Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 
195 200 ~ 205 

gee gaa age atg att cca atg gga tat ate ggc gaa ccg gag gag ate 672 
Ala Glu Ser Met He Pro Met Gly Tyr -He Gly Glu Pro Glu Glu He 
210 215 220 

gee gca gta gca gee tgg ctt get teg aag gaa gee age tac gtc aca 720 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser -Tyr Val Thr 
225 230 235 " 240 

ggc gtc acg tta ttc gcg gac ggc ggt atg aca eta tat cct tea ttc 758 
Gly Val Thr Leu Phe Ala Asp Gly Gly Met Thr Leu Tyr Pro Ser Phe 
245 250 * 255 

cag gca ggc cge ggt taa tga 789 
Gin Ala Gly Arg Gly * * 
260 



<210> 64 
<211> 261 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GDH 2331 
<400> 64 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 

1 5 10 is 

Ser Gly' Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 

2 0 25 3 0 

Lys. Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 

35 40 ~ " 45 

Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 

50 55 ' 60 

Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 80 

Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 

85 90 ~ 95 

Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 

100 105 " " 110 

He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 

115 120 125 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 

130 135 140 

Ser Val His Glu Val lie Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 150 155 ' 160 

Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 

165 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala He Asn 

180 185 " 190 

Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 

195 200 ' 205 

Ala Glu Ser Met He Pro Met Gly Tyr lie Gly Glu Pro Glu Glu lie 
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210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 ' 235 ~ 240 

Gly Val Thr Leu Phe Ala Asp Gly Gly Met Thr Leu Tyr Pro Ser Phe 

245 250 * 255 

Gin Ala Gly Arg Gly 
260 



<210> 65 

<211> 789 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GDH 2279 

<221> CDS 

<222> (1) . . . (739) 



<400> 65 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 48 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala lie Thr Gly Ala Ala 
15 10 15 

tea ggg etc, gga aag gcg atg gee att cgc ttc ggc aag gag cag gca 96 

Ser Gly Leu' Gly Lys Ala Met Ala lie Arg Phe Gly Lys Glu Gin Ala 
20 25 30 



aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag gta 144 

Lys Val Val lie Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
35 40 45 

aaa gaa 'gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 

Lys Glu Glu Val lie Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 
• 50 55 60 

gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 240 

Asp Val Thr Lys Glu Glu Asp Val Lys Asn lis Val Gin Thr Ala lie 
65 70 75 80 

aag gag ttc ggc aca etc gat att atg att aat aat gec ggt ctt gaa 238 

Lys Glu Phe Gly Thr Leu Asp lie Met lie Asn Asn Ala Gly Leu Glu 
85 90 95 



aat cct gtg cca tct cac gaa atg ccg etc aag gat tgg gat aaa gtc 336 
Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 
100 105 HO 

ate ggc acg aac tta acg ggt gee ttt tta gga age cgt gaa gcg att 384 
lie Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 
115 120 125 



aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec 432 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 
130 135 140 

agt gtg cac gaa gtg att cct tgg ccg tta ttt gtc cac tat gcg gca 480 
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Ser Val His Glu Val lie Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 150 155 160 

agt aaa ggc ggg atg aag ctg atg aca gaa aca tta gcg ttg gaa tac 528 
Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 
.165 170 175 

gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt gcg ate aac 576 
Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala He Asn 
180 185 .190 

acg cca ate aat get gaa" aaa ttc get gac cct aaa cag aaa get gat 624 
Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 
195 2 00 2 05 

gee gaa age atg att cca atg gga tat ate ggc gaa ccg gag gag ate 672 
Ala Glu Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 
210 215 220 

gec gca gta gca gee tgg ctt get teg aag gaa gee age tac gtc aca 720 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 235 240 

ggc gtc acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea ttc 768 
Gly Val Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
245 250 " 255 



cag gca ggc cgc ggt taa tga 
Gin Ala Gly Arg Gly * * 
260 



789 



<210> 65' 
<211> 261 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GDH 2279 



<4G0> 66 



Met 


Tyr 


Pro 


Asp 


Leu 


Lys 


Gly Lys 


Val Val Ala lie Thr Gly Ala Ala 


1 








5 






10 15 


Ser 


Gly 


Leu 


Gly 


Lys 


Ala 


Met Ala 


lie Arg Phe Gly Lys Glu Gin Ala 








20 








25 30 


Lys 


Val 


Val 


lie 


Asn 


Xyr 


Tyr Ser 


Asn Lys Gin Asp Pro Asn Glu Val 






35 








40 


45 


Lys 


Glu 


Glu 


Val 


He 


Lys 


Ala Gly 


Gly Glu Ala Val Val Val Gin Gly 




50 










55 


60 


Asp 


Val 


Thr 


Lys 


Glu 


Glu 


Asp Val 


Lys Asn He Val Gin Thr Ala lie 


65 










70 




75 80 


Lys 


Glu 


Phe 


Gly 


Thr 


Leu 


Asp lie 


Met lie Asn Asn Ala Gly Leu Glu 










85 






90 95 


Asn 


Pro 


Val 


Pro 


Ser 


His 


Glu Met 


Pro Leu Lys Asp Trp Asp Lys Val 








100 








105 ' 110 • 


lie 


Gly 


Thr 


Asn 


Leu 


Thr 


Gly Ala 


Phe Leu Gly Ser. Arg Glu Ala lie 






115 








120 


125 ' 
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Lys Tyr Phe Val Glu Asn Asp lie Lys Gly Asn Val lie Asn Met Ser 

130 135 14Q 

Ser Val His Glu Val lie Pro Trp Pro Leu Phe Val His Tyr Ala Ala 

1 150 155 160 

Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 

165 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn lie Gly Pro Gly Ala lie Asn 

ISO 185 i9 0 

Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Aso 

195 200 205 ~ 

Ala Glu Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu He' 

210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 235 o 40 

Gly Val Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 

245 250 255 

Gin Ala Gly Arg Gly 
260 

<210=> -67 
<211> 789 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GDH 2379 

<221> CDS 

<222> (1) . . . (789) 

<400> 67 

atg tat peg gat tta aaa gga aaa gtc gtc get att aca gga get act 
Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 
1 5 io 1S 

tea ggg etc gga aag gcg atg gee att ego ttc ggc aag gag caggca 
Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 
20 25 30 



48 



96 



aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag gta 144 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
35 40 45 

aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 
Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Glv 
SO 55 60 Y 

gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 2*0 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin T hr Ala He 
65 70 75 80 

aag gag ttc ggc aca etc gat att atg att aat aat gee ggt ctt gaa ?33 
Lys Glu Phe Gly Thr Leu Asp lie Met He Asn Asn Ala Gly Leu Glu 
85 90 95 

aat cct gtg cea tct cac gaa atg ccg etc aag gat tag. gat aaa gtc 336 
Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 
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100 105 no 

ate ggc acg aac tta acg ggt gec ttt tta gga age cgt gaa gcg att 3 34 

He Gly Thr Asn Leu Thr Gly- Ala Phe Leu Gly Ser Arg Glu Ala He 
115 120 125 

aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec 432 
Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 
130 135 140 

agt gtg cac gaa gtg att cct tgg ccg tta ttt gtc cac tat gcg gca 4 80 

Ser Val His Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 150 155 160 

agt aaa ggc ggg ctt aag ctg atg aca gaa aca tta gcg ttg gaa tac 528 
Ser Lys Gly Gly Leu Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 
165 .170 175- 

gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt gcg ate aac 5 76 

Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Gly Ala He Asn 
180 185 - 190 

acg cca ate aat get gaa aaa ttc gct'gac cct aaa cag aaa get gat 624 
Thr Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 
195 200 ^ 205 

gta gaa age atg att cca atg gga tat ate ggc gaa ccg gag gag ate 672 
Val Glu Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu lie 
210 215 ^ " " 220 

gee gca gta gca gee tgg ctt get teg aag gaa gee age tac gtc aca 720 
Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 ~ 235 240 

ggc. ate acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea ttc 768 
Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
245 250 255 

cag gca ggc cgc ggt taa tga 7 89 

Gin Ala Gly Arg Gly * * 
260 



<210> 68 
<211> 261 
<212>' PRT 

<213> Artificial Sequence 
<220> 

<223> GDH 2379 
<400> 68 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 

15 10 15 

Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 

20 25 ^ 30 

Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu. Val 
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35 40 45 

Lys Glu Glu Val lie Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 

50 55 60 

Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 so 

Lys Glu Phe Gly Thr Leu Asp He Met lie Asn Asn Ala Gly Leu Glu 

85 90 95 

Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 

100 105 ~ 110 

He Gly Thr Asn Leu Thr Gly Ala Phe Leu Glv Ser Arg Glu Ala He 

115 120 12 5 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 

130 135 i4 0 

Ser Val His Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 15 ° 155 i 60 

Ser Lys Gly Gly Leu Lys Leu Met Thr Glu Thr Leu Ala -Leu Glu Tyr 

1*55 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn He Gly Pro Glv Ala He Asn 

180 185 * 190 

Thr . Pro He Asn Ala Glu Lys Phe Ala Asp Pro Lys Gin Lys Ala Asp 
195 200 
-Val Glu "Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 

210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 235 240 

Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 

245 250 ' 255 

Gin Ala Gly Arg Gly 
260 

<210> 69 
<211> 1206 
<212 > * DNA 

<213> Artificial Sequence 
<220> 

<223> FDH FDHPs3 

<221>- CDS 

<222> (1) . . . (1206) 

<4Q0> 69 

atg gca aaa gtt eta tgt gtt eta tat gat gat ccg gtt gat got tat 48 

Met Ala Lys Val Leu Cys Val Leu Tyr Asp Asp Pro Val Asp Gly Tyr 
15 10 15 

ccg aaa acc tat gca cgt gat gat eta ccg aaa att gat cat tat ccg Q6 
Pro Lys Thr Tyr Ala Arg Asp Asp Leu Pro Lys He Asp His Tyr Pro 
20 25 ~ 30 

ggt ggt cag acc eta ccg acc ccg aaa gca att gat ttt acc ccg ggt 144 
Gly Gly Gin Thr Leu Pro Thr Pro Lys Ala He Asp Phe Thr Pro Gly 
35 40 45 

cag eta eta ggt age gtt age ggt gaa eta ggt eta cqt aaa tat eta iq 2 
Gin Leu Leu Gly Ser Val Ser Gly Glu Leu Gly Leu Arg Lys Tyr Leu 
50 55 60 
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gaa age aac ggt cat ace eta gtt gtt acc age gat aag gac ggc cct 240 
Glu Ser Asn Gly His Thr Leu Val Val Thr Ser Asp Lys Asp Gly Pro 
65 70 75 80 

gac age gtg ttc gag cgc gag eta gtg gac gee gac gtg gtg att age 2 83 

Asp Ser Val Phe Glu Arg Glu Leu Val Asp Ala Asp Val Val tie Ser 
85 90 " 95 

cag cct ttc tgg cct gec tat eta acc cct- gag cgc att .gee aag gee 336 
Gin Pro Phe Trp Pro Ala Tyr Leu Thr Pro Glu Arg lie Ala Lys Ala 
100 105 ~ 110 

aag aat eta aag eta gee eta acc gec ggc att ggc age gac cat gtg 3 84 

Lys Asn Leu Lys Leu Ala Leu Thr Ala Gly lie Gly Ser Asp His Val 
115 • 120 125 

gac eta cag age gec att gac cgc aat gtg acc gtg gee gag gtg acc '432 
Asp Leu Gin Ser Ala lie Asp Arg Asn Val Thr Val Ala Glu Val Thr ' 
130 135 140 

tat tgt aat age att age gtg gec gag cat gtg gtg atg atg att eta 480 
Tyr Cys Asn Ser -He Ser Val Ala Glu His Val Val Met Met He lieu 
145 150 155 160. 

age eta gtg cgc aat tat eta cct tec cat gaa tgg gcg cgt aaa ggc 528 
Ser Leu Val Arg Asn Tyr Leu Pro Ser His Glu Trp Ala Arg Lys Gly 
165 170 - 175 

ggc tgg aac ate gcg gat tgc gtc tec cat gcg tat gat ctg gaa gcg 576 
Gly Trp Asn He Ala Asp Cys Val Ser His Ala Tyr Asp Leu Glu Ala 
" 180 185 190 

atg cat gtc ggc acg gtc gcg gcg ggc cgt ate gec ctg gcg gtc ctg 624 
Met. His Val Gly Thr Val Ala Ala Gly Arg He Ala Leu Ala Val Leu 
195 200 205 

cgt cgt ctg gcg ccg ttt gat gtc cat ctg cat tat acg gat cgt cat 672 
Arg Arg Leu Ala Pro Phe Asp Val His Leu His Tyr Thr Asp Arg His 
210 215 220 

cgt ctg ccg gaa teg gta gaa aaa gaa tta aac tta acg tgg cat gcg 720 
Arg Leu Pro Glu Ser Val Glu Lys Glu Leu Asn Leu Thr Trp His Ala 
225 230 235 *" 240 

acg agg gaa gat atg tac cca gta tgt gat gta gta acg tta aac tgt 768 
Thr Arg Glu Asp Met Tyr Pro Val Cys Asp Val Val Thr Leu Asn Cys 
245 " 250 255 

cca tta cat cca gaa acg gaa cat atg att aac gat gaa acg tta aaa 816 
Pro Leu His Pro Glu Thr Glu His Met He Asn Asp Glu Thr Leu Lys 
260 265 270 

tta ttc aaa agg gga gcg tac att gtc aac acg gcg aga ggc aaa ttg 864 
Leu Phe Lys Arg Gly Ala Tyr He Val Asn Thr Ala Arg Gly Lys Leu 
275 280 285 
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tgc gat aga gat gcg gtc gcg aga gcg ttg gaa tea ggc aga ttg qca 91? 
Cys Asp Arg Asp Ala Val Ala Arg Ala Leu Glu Ser Gly Arg Leu Ala 
290 295 3 00 

ggc tat gcg ggc gat gtc tgg ttt ccg caa ccg gcg ccg aaa gat cat 960 
Gly Tyr Ala Gly Asp Val Trp Phe Pro Gin Pro Ala Pro Lys Asp His 
305 310 315 " 320 

ccg tgg aga a eg atg ccg tat aac ggc atg acg ccg cat att tea ggc 1003 
Pro Trp Arg Thr Met Pro Tyr Asn Gly Met Thr Pro Kis He Ser Gly 
325 330 335 

acg acg ttg acg gcg caa gcg aga tat get gcg ggc acg aga gaa att 1056 
Thr Thr Leu Thr Ala Gin Ala Arg Tyr Ala Ala Gly Thr Arg Glu He 
340 . 345 350 

ttg gaa tgc ttt ttt gaa ggc aga cca ate cgt gac gaa tat ctg ate 1104 
Leu Glu Cys Phe Phe Glu Gly Arg Pro He Arg Asp Glu Tyr Leu He 
355 360 365 

gtc cag ggt ggt gee ctg gec ggt acc ggt gee cat tct tat tct aaa 1152 
Val Gin Gly Gly Ala Leu Ala -Gly Thr Gly Ala His Ser Tyr Ser Lys 
370 375 380 

ggt aat gee acc ggt ggt tct gaa gaa gee aaa ttc aaa aaa gee gtc 1200 
Gly Asn Ala Thr Gly Gly Ser Glu Glu Ala Lys Phe Lys Lys Ala Val 
385 390 . 395 * 400 



<210> 70 
<211> 400 
<212> PRT 

<213> Pseudomonas sp. strain 101 

<400> 70 
Me 
1 



65 



Ala 


Lys 


Val 


Leu 


Cys 


Val 


Leu 


Tyr Asp Asp Pro Val Asp Glv 


Tyr 


Lys 


Thr 




5 








10 15* 


Tyr 


Ala 


Arg 


Asp 


Asp 


Leu Pro Lys He Asp His Tyr 


Pro 


Gly 




20 










25 30 




Gin 


Thr 


Leu 


Pro 


Thr 


Pro 


Lys Ala He Asp Phe Thr Pro 


Gly 




35 










40 


45 


Leu 


Leu 


Gly 


Ser 


Val 


Ser 


Gly 


Glu Leu Gly Leu Arg Lys Tyr 


Leu 


50 










55 




60 




Ser 


Asn 


Gly 


His 


Thr 


Leu 


Val 


Val Thr Ser Asp Lys Asp Gly Pro 










70 






75 


80 


Ser 


Val 


Phe 


Glu 
85 


Arg 


Glu 


Leu 


Val Asp Ala Asd Val Val lie 
90 95 


Ser 


Pro 


Phe 


Trp 
100 


Fro 


Ala 


Tyr 


Leu 


Thr Pro Glu Arg He Ala Lys 
105 no 


Ala 


Asn 


Leu 
115 


Lys 


Leu 


Ala 


Leu 


Thr 
120 


Ala Gly lie Gly Ser Asp His 
125 


Val 


Leu 


Gin 


Ser 


Ala 


He 


Asp 


Arg 


Asn Val Thr Val Ala Glu Val 


Thr 


13 0 










135 




140 
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Tyr 


Cys 


Asn Ser 


He 


Ser 


Val 


Ala Glu His Val 


Val Met Met He Leu 


145 








150 




155 


160 


Ser 


Leu 


Val Arg 


Asn 


Tyr 


Leu 


Pro Ser His Glu 


Trp Ala Arg Lys Gly 








1(55 






170 


175 


Gly 


Trp 


Asn lie 


Ala 


Asp 


Cys 


Val Ser His Ala 


Tyr Asp Leu Glu Ala 






180 








185 


190 


Met 


His 


Val Gly 


Thr 


Val 


Ala 


Ala Gly Arg He 


Ala Leu Ala Val Leu 






195 








200 


205 


Arg 


Arg 


Leu Ala 


Pro 


Phe 


Asp 


Val His Leu His 


Tyr Thr Asp Arg His 




210 








215 




220 


Arq 


Leu 


Pro Glu 


Ser 


Val 


Glu 


Lys Glu Leu Asn 


Leu Thr Trp His Ala 


225 








23 0 




235 


24 0 


Thr 


Arq 


Glu -Asp Met 


Tvr 


Pro 


Val Cys Asp Val 


Val Thr Leu Asn Cys 








245 






250 


255 


Pro 


Leu 


His Pro 


Glu 


Thr 


Glu 


His Met He Asn 


Asp Glu Thr Leu Lys 






26o' 








265 


270 


Leu 


Phe 


Lys Arg Gly 


Ala 


Tyr 


lie Val Asn Thr 


Ala Arg Gly Lys Leu 






275 








280 


j~ O 3 


Cvs 


Asp 


Arg Asp 


Ala 


Val 


Ala 


Arg' Ala Leu Glu 


Ser Glv Arcr Leu Ala 




2 90 








2 95 




ion 


Glv 


Tvr 


Ala Gly 


Asp 


Val 


Trn 


Phe Pro Gin Pro 


Ala. P3"n Lv 1 ? A^*d Hie; 


J U -J 








—I -L U 






_* -t- u 


Pro 


Trp 


Arg Thr 


Met 


Pro 


Tyr 


Asn Gly Met Thr 


Pro His He Ser Gly 








325 






330 


335 


Thr 


Thr 


Leu Thr 


Ala 


Gin 


Ala 


Arg Tyr Ala Ala 


Gly Thr Arg Glu He 






340 








345 


350 


Leu 


Glu 


Cys Phe 


Phe 


Glu 


Gly 


Arg Pro He Arg 


Asp Glu Tyr Leu He 






355 








360 


365 


Val 


Gin 


Gly Gly Ala 


Leu 


Ala 


Gly Thr Gly Ala 


His Ser Tyr Ser Lys 




370 








375 




380 


Gly 


Asn 


Ala Thr Gly 


Gly 


Ser 


Glu Glu Ala Lys 


Phe Lys Lys Ala Val 


385 








3 90 




395 


400 



<210> 71 
<211> 1098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> FDH FDHCbl3 

<221> CDS 

<222> (1) . . . (1098) 

<400> 71 

atg aaa ate gta etc gta etc tac gat gca ggc aaa cac gca gca gat 48 

Met Lys He Val Leu Val Leu Tyr Asp Ala Gly Lys His Ala Ala Asp 
1 5 10 15 

gaa gaa aaa etc tac ggc tgc acg gaa aat aag ctg ggc att gca aat 96 
Glu Glu Lys Leu Tyr Gly Cys Thr Glu Asn Lys Leu Gly He Ala Asn 
20 * 25 30 

tgg ctg aag gat cag ggc cac gaa ctg att acg acg tea gat aag gaa 144 
Trp Leu Lys Asp Gin Gly His Glu Leu He Thr Thr Ser Asp Lys Glu 
35 40 45 
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ggc ggt aat tec gtc ttg gat caa cac ate ccc gat get gat ate ate 192 
Gly Gly Asn Ser Val Leu Asp Gin His lis Pro Asp Ala Asp lie lie 
50 55 60 

ate aca aca ccc ttc cac ccc get tae ate aca aaa gaa aga ate gat 240 
lie Thr Thr Pro Phe His Pro Ala Tyr He Thr Lys Glu Arg He Asp 
65 ' 70 75 80 

aaa get aaa aaa ttg aaa ttg gtc gtc gtc get ggt gtc ggt tec gat 288 
Lys Ala Lys Lys Leu Lys Leu Val Val Val Ala Gly Val Gly Ser Asp 
85 90 95 

cac ate gat ttg gat tac ate aat caa aca ggt aaa aaa .ate tec gtc 336 
His He Asp Leu Asp Tyr He Asn Gin Thr Gly Lys Lys He Ser Val 
100 " 105 110 



ttg gaa gtc aca ggt tec aat gtc gtc tee gtc get gaa cac gtc gtc 
Leu Glu Val Thr Gly Ser Asn Val Val Ser Val Ala Glu His Val Val 
115 " 120 125 



gat ate gaa ggt aaa aca ate get aca ate ggt get ggt aga ate ggt 
Asp He Glu Gly Lys Thr He Ala Thr He Gly Ala Gly Arg He Gly 
165 170 .175 



gtt ace gtt aac get ccg ttg cac get ggt ace aaa ggt ttg ata aac 
Val Thr Val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu He Asn 
225 230 235 240 



get cgt ggt get ata tgc gtt get gaa gat gtt get get get ttg gaa 
Ala Arg Gly Ala He Cys Val Ala Glu Asp Val Ala Ala Ala Leu Glu 
260 • 265 270 



384 



atg aca atg ttg gtc ttg gtc .aga aat ttc gtc ccc get cac gaa caa 432 
Met Thr Met Leu Val Leu Val Arg Asn Phe Val Pro Ala His Glu Gin 
130 135 140 

ate ate aat cac gat tgg gaa gtc get get ate get aaa gat get tac 480 
He lie Asn His Asp Trp Glu Val Ala Ala He Ala Lys Asp Ala Tyr 
145 150 155 160 



528 



tac aga' gtc ttg gaa aga ttg gtc ccc ttc aat ccc aaa gaa ttg ttg 576 
Tyr Arg Val Leu Glu Arg Leu Val Pro Phe Asn Pro Lys Glu Leu Leu 
180 ~ 185 190 

tac tac gat tac caa get ttg ccc aaa gat get gaa gaa -aaa gtt ggt 624 
Tyr Tyr Asp Tyr Gin Ala Leu Pro Lys Asp Ala Glu Glu Lys Val Gly 
195 200 205 

get cgt cgt gtt gaa aac ata gaa gaa ttg gtt get cag get gat ata 672 
Ala Arg Arg Val Glu Asn He Glu Glu Leu Val Ala Gin Ala Asp lie 
210 215 220 



720 



aaa gaa ttg ttg tea aaa ttt aaa aaa ggt get tgg ttg ctt aac ace 768 
Lys Glu -Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Leu Asn Thr 
245 250 255 



816 



tea ggt cag ttg cgt ggt tac ggt ggt gat gtt tgg ttt ccg cag ccg 864 



76 
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Ser Gly Gin Leu Arg Gly Tyr Gly Gly 
275 280 

get ccg aaa gat cac ccg tgg cgt gat 
Ala Pro Lys Asp His Pro Trp Arg Asp 
290 295 

ggt aac get atg acc ccg cac tac tea 
Gly Asn Ala Met Thr Pro His Tyr Ser 
305 310 • 

acc cgt tac get cag ggt acc aaa aac 
Thr Arg Tyr Ala Gin Gly Thr Lys Asn 
325 

ggt aaa ttt gat tat cgt cca cag gat 
Gly Lys Phe Asp Tyr Arg Pro Gin Asp 
340 345 

tat gtt acc aaa gee tat ggt aaa cac 
Tyr Val Thr Lys Ala Tyr Gly Lys His 
355 " ~ 360 



Asp Val Trp Phe Pro Gin Pro 
285 

atg cgt aac aaa tac ggt get 912 
Met Arg Asn Lys Tyr Gly Ala 
300 

ggt acc acc ttg gat get cag 960 
Gly Thr Thr Leu Asp Ala Gin 
315 320 

ate etc gaa teg. ttt ttt acc 1008 
He Leu Glu Ser Phe Phe Thr 
330 335 

ate ate etc etc aac ggt gaa 1056 
He He Leu Leu Asn Gly Glu 
350 

gat aaa aaa taa tga 1098 
Asp Lys Lys * * 



<210> 72 
<211> 364 
<212> PRT 

<213> Candida boidinii 



<40G> 72 



Met 


Lys 


He 


Val 


Leu 


Val Leu Tyr Asp 


Ala 


Gly Lys His Ala Ala Asp 


1 








5 




10 




15 


Glu 


Glu 


Lys 


Leu 


Tyr 


Gly Cys Thr Glu 


Asn 


Lys 


Leu Gly lie Ala Asn 








20 




25 






30 


Trp 


Leu 


Lys 


Asp 


Gin 


Gly His Glu Leu 


lie 


Thr 


Thr Ser Asp Lys Glu 






35 






40 






45 


Gly Gly 


Asn 


Ser 


Val 


Leu Asp Gin His 


lie 


Pro 


Asp Ala Asp lie lie 




50 








55 






60 


lie 


Thr 


Thr 


Pro 


Phe 


His Pro Ala Tyr 


lie 


Thr 


Lys Glu Arg lie Asp 


65 










70 




75 


80 


Lys Ala 


Lys 


Lys 


Leu 


Lys Leu Val Val 


Val 


Ala 


Gly Val Gly Ser Asp 










85 




90 




95 


His 


He 


Asp 


Leu 


Asp 


Tyr lie Asn Gin 


Thr Gly Lys Lys lie Ser Val 








100 




105 






110 


Leu 


Glu 


Val 


Thr 


Gly 


Ser Asn Val Val 


Ser 


Val 


Ala Glu His Val Val 






115 






120 






125 


Met 


Thr 


Met 


Leu 


Val 


Leu Val Arg Asn Phe Val 


Pro Ala His Glu Gin 




130 








135 • 






140 


He 


lie 


Asn 


His 


Asp 


Trp Glu Val Ala Ala 


lie 


Ala Lys Asp Ala Tyr 


145 










150 




155 


160 


Asp 


lie 


Glu 


Gly 


Lys 


Thr lie Ala Thr 


lie 


Gly Ala Gly Arg He Gly 










165 




170 




175 


Tyr Arg 


Val 


Leu 


Glu 


Arg Leu Val Pro 


Phe Asn 


Pro Lys Glu Leu Leu 








180 




185 






190 


Xyr 


Tyr 


Asp 


Tyr 


Gin 


Ala Leu Pro Lys 


Asp 


Ala 


Glu Glu Lys Val Gly 






195 






200 






205 


Ala 


Arg 


Arg 


Val 


Glu 


Asn He Glu Glu 


Leu 


Val 


Ala Gin Ala Asp lie 
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210 215 2 ?0 

Val Thr Val Asn Ala Pro Leu His Ala Gly Thr Lys Gly Leu lie Asn 
**" 2 3 0 23 5 

Lys Glu Leu Leu Ser Lys Phe Lys Lys Gly Ala Trp Leu Leu Asn inr 

Ala Arg Gly Ala lie Cys Val Ala Glu Asp Val Ala Ala Ala Leu Glu 

260 255 270 

Ser Gly Gin Leu Arg Gly Tyr Gly Gly Asp Val Trp Phe Pro Gin Pro 

275 280 ., 35 

Ala Pro Lys Asp His Pro Trp Arg Asp Met Arg Asn Lys Tyr Gly Ala 

Gly Asn Ala Met Thr Pro His Tyr Ser Gly Thr Thr Leu Asp Ala Gin 

3X0 3X5 
Thr Arg Tyr Ala Gin Gly Thr Lys Asn He Leu Glu Ser Phe Phe Tnr 

Gly Lys Phe Asp Tyr Arg Pro Gin Asp lie lie Leu Leu Asn Gly Glu 
340 345 350 

. Tyr Val Thr Lys Ala Tyr Gly Lys His Asp Lys Lys 
355 360 

<210> 73 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223=- HHDH P016514-B-12 

<221> CDS 

<222> (1) . . . (765) 

<400> 73. 

atg age ace get ate gtc acc aac gtc aaa cat ttt gga gqt atq act 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe fly fly Met lly 
1 5 . 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala c?s His lip 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe La Glu Thr Tyr 
35 40 45 * 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu 11= Su 
50 " 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc aac aac aat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Lp Val Leu ?£ sir Asn Isp 
65 70 75 so 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc caq oat 
He Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Gin Asp 

8S 90 ■ 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get. eta gtg 

7S 



43 



96 



144 



19: 



240 



283 



336 
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Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cae ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 ~ * 125 

ttc ate act teg get get ccg ttc ggg cca tgg aag gag eta teg act * 432 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
13 0 13 5 14 0 

tac act teg get cga get ggg get tgt tee eta get aat. get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Ser Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 A 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser" Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 74 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016514-B-12 
<400> 74 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 ' 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 
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Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
b5 70 75 

He Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Gin Asp 

85 g 0 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin n e Arg Pro Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 

115 120 i 25 

Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
13 0 135 i4 0 

• Tyr Thr Ser Ala Arg Ala Gly Ala Cys Ser Leu Ala Asn Ala Leu Ser 
145 150 155 i60 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 

1S 5 17Q 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

130 185 i go 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205' 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

' 2 ? 5 230 235 240 

Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 75 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH M=1/4H6 

<221> CDS 

<222> (1) . . . (765) 

<400> 75 

atg age acc get ate gtc acc aac gtc aaa cat ttt ggt ggt atg qqt 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
15 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctq ate aaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
SO 55 60 

get gtc acc age get tac ggt cag gtc gat gtc -ctg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
55 70 75 so 



48 



96 



144 



192 



240 



SO 
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ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 283 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 " " 95 



tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 
Tyr Arg Gly Ala Val Glu Ala Leu Gin Il e Arg Pro Phe Ala Leu Val 
'100 105 *" no 



336 



aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His Ile Ile 
115 120 ~ ~ 125 

ttc ate act teg get act ccg ttc ggg ccg tgg aag gag eta teg act 432 
Phe Ile Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 iso ' 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn Ile Pro Val Phe Ala Ile Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 5 76 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 " 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 " 205 . 

caa cga- eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala ' 
225 230 ' 235 " 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met Ile Glu Arg Trp Pro Gly Met Pro Glu * 
245 250- 



<210> 76 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH MZ1/4H6 
<400> 76' 

Met Ser Thr Ala Ile Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 * ' 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 



81 
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20 

" 5 30 



Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

40 45 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 



40 

: Gly Gin Val Asp Val 
70 7S 
He Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

Tyr Arg Gly Ala Val Glu Ala Leu Gin IL Arg Pro Phe ^ ^ 

00 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

HI Thr Ser Ala ^9 £La G1 y ^a Cys Thr Leu Ala Asn Ala Leu Ser 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
1SS 170 175 ' 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Tro 

185 190 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
„, 10 ' 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
A ~ 230 ?35 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu **° 
245 250 

<210> 77 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016229-F-04 

<221> CDS 

<222> (1) ... (765) 

<400> 77 



atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atq qqt 
Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe g!v Gly Mel lly 
1 5 10 15 

agt get ctg agg ctg teg gag get ggt cac acc gtc get tgc cat qat 
Ser Ala Leu Arc Leu Ser Glu Ala Gly His Thr Val Ala C?s His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gag get ttt get gaa acc tac 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe lla Gin J" iyt 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gga eta att 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro lla Gly. Leu 111 Glu 



95 



144 
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get gtc ace age get tac ggt cag gtc gat gtc etg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 



240 



80 



288 



ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 
lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 
85 90 * ' 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt .get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arq Pro Phe Ala Leu .Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 12 0 " 125 

ttc ate act teg get act ccg" ttc ggg cca tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 - 135 ^ ' 140. 

tac act teg get cga- get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 . 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg' gag gat teg ccg tac ttc tac ccg act gag ccg tgg 57b 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 "* 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 ^ ' 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 * 250 



<210> 78 

<211> 254 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016229-F-04 
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<400> 78 

Met Ser Thr Ala H e Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

5 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 * 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 

35 40 45 . 1 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Gly Leu lie Glu 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

_ 70 7 5 80 

He Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tvr Ala Val Glu Asp 

85 90 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

100 105 , 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 

115 120 i, 5 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 iso 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 

165 ivo 1?5 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

ISO 135 19Q 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly .Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

2X0 215 2 ?0 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

230 235 240 

Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 79 
<211> 755 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016230-A-08 

<221> CDS 

<222> (1) . . . (765) 

<400> 79 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg qqt 
Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
5 io 



15 



age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 



48 
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35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 19: 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 60 



get gtc ace age get tac ggt cag gtc gat gtc ctg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 



240 



ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 238 
lie Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 J 95 

tat cgt ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 33b 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 HO 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 

ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 " 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 150 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185' 190 

aag act aat ccg gaa cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 ' 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 " 230 235 ~ 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 7 65 

Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 " 250 



<210> 80 
<211> 254 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016230-A-03 
<400> 80 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His T hr Val Ala Cys His Asp 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 4 5 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

55 60 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

70 75 



. lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

35 9 ° 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

105 • no 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 

Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ma Asn Ala Leu Ser 

155 i cn 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg -Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 2?0 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

230 235 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu **° 
245 250 

<210> 81 
<211> 765 
. <212> DNA 
<213> Artificial Sequence 

<220> 

<223> HHDH P016096-G9 

<221> CDS 

<222> (1) . . . (765) 

<400> 31 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg got 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Ify Met Gly 

5 10 15 

age get ctg agg etg age gaa get ggt cat acc gtc get tgc cat gat 



43 



96 



S6 
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Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Aso 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa ace tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg. aaa cca atg age gaa cag gaa cea get gaa ctg ate gaa 192 
Pro Gin Leu- Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 go 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat ■ 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val .Leu Val Ser Asn Asp 
65 '.70 75 80 

ate ttt get cea gaa ttt cag cca ate gat aaa tac get gtc gaa gat 283 
lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 
85 90 1 " 95 

tac agg ggt get gtc gaa get ctg cag ate ago cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate. 384 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 
115 120 ~ ' 125 

tte ate act teg get act ccg ttc ggg cca tgg aaa gag eta teg act 432 
Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 " 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 430 
Tyr Thr" Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 ISO 

aag gag eta gga gag tac aat ate ccg gtg tte get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 " 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 " 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttc ctg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca ' 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu .Ala 
225 230 * 235 240 

ggc ggc ttt ccc att ate" gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro lie He Glu Arg Trp Pro Gly Met Pro Glu * 
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245 250 



<210> 32 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016095-G9 
<400> 82 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1.5 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 ^ 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 

35 40 45 

Pro- Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

50 55 6 o 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 " q 5 

•Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 ~ " 125 • 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

13 0 13 5 14 0 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 ' 19Q 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 . " 2 40 

Gly Gly ?he Pro He lie Glu Arg Trp Pro Gly Met Pro Glu 
245 * 250 



<210> 83 
<211> 765 
"<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016097-F9 

<221> CDS 

<222> (1) . . . (765) 

<400> 83 
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atg acc acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 
Met Thr Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
15 10 * 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get tac ggt cag gtc gat a'tc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp He Leu Val Ser Asn Asp 
65 70 75 80 ' 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc cag gat 288 
He Phe Ala Pro' Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 
85 90 * 95. 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 
115 120 ~ 125 

ttc ate -act teg get get ccg ttc ggg cca tgg aag gag eta teg act 432 
, Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 " 140 

tac act teg get cga get ggg get tgt act eta get aat get .eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 ' 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 . 190 • 

aag act aat ccg gag cac gtg get " cac gtg aag aag gtg act get eta '624 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca . 720 
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Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

230 23 5 240 

rf ^ CCC at9 ata 9aa CSt t9g Ccc atg ccc gaa taa 

uly Gly Phe Pro Met lie Glu Arg Trp P ro Gly Met Pro Glu * 

250 



<210> 84 
<211> 254 
"<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH P016097-F9 



<400> 84 

Met Thr Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

5 10 IS 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 • 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

33 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp lie Leu Val Ser Asn Asp 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 

Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ala Leu Val 

100 xo5 11Q 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie He 

Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 

150 155 - - n 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 

155 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trr, 

18 0 18 5 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 



235 



Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Gi u 
245 250 

<210> 85 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016097-K10 



240 



90 



WO 2005/018579 



PCT/US2004/004819 



<22l> CDS 

<222> (1) . . . (765) 

<400> 85 

atg acc acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 4a 

Met Thr Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met G?y 

1 5 io i 5 

age get ctg agg ctg age gaa get ggt cat ace gtc get -tgc cat gat 95 
Ser Ala Leu Arg Leu Ser Glu Ala. Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa eag aaa gat gaa ctg gaa get ttt get gaa acc tac i*d 
Glu ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa eag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu ' 
5 ° 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 2*0 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 ao 

ate ttt get eea gaa ttt cag cca ate gat aaa tac get gtc cag gat 288 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 
85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 
115 120 " 125 ' 

.ttc ate act teg get get ccg ttc ggg cca tgg aag gag eta teg act 43? 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
13 ° 135 140 



tac act teg get cga get ggg get tgt tec eta get aat get eta teg 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Ser Leu Ala Asn Ala Leu Ser 
145 150 155 i 6 o 



tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro- Trp 
180 185 19Q 



480 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 
165 170 175 



576 



aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 62* 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 
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caa cga eta ggg act caa aaa gag ttg ggg qaa tta ata an. ^_ 
Gin Arg Leu Gly Thr Gin Lys Glu LeS ^ ?2 S L LeS 

10 215 220 

IS 52 rf c Ct t9t 9at tat " 9 aCt -99= cag 3tg ttt tgg ttg qca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu lit 

235 240 • 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro cly Met Pro Ilu * 
•^ 45 250 



672 



720 



765 



<210> 36 
<211> 254 
<212> PRT 

<213> Artificial Sequence 

/ 

<220> 

<223> HHDH P01S097-H10 
•<400> 85 

Met Thr Thr Ala lie Val Thr Asn Val Lys His Phe Gly Glv Met Glv 

5 10 " 15 

Ser Ala Leu Arg Leu Ser Glu Ala' Gly His Thr Val Ala Cys His Asp 

Glu Ser Phe Lys Gin Lys Asp Glu leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 4 g 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

He Phe Ala Pro Glu Phe Gin Pro He Asp Zys Tyr Ala Val Gin isp 

Tyr Arg Gly Ala Val Glu Ala Leu Gin U e Arg Pro Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His II- n a 

115 120 ■ 12S 

Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Ser Leu III Asn Ala Leu Ser 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro lln 

lbS 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Tro 
180 ^ 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 , 05 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

230 235 



Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pt- q Glu 
245 250 



240 



<210> 87 
<2il> 765 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016099-A1 

<221> CDS 

<222> (1) . . . (765) 

<400> 87 

atg acc acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 4S 

Met Thr Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 .15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser.Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 3 0 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 
35 40 .45 

cca cag ctg aaa cca atg age gaa cag gaa cea get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 • 60 

get gtc acc age get tac ggt cag gtc gat ate ctg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp He Leu Val Ser Asn Asp 
65 70 75 eo ■ 



ttc ate act teg get get ccg ttc ggg cca tgg aag gag eta teg act 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Tro Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 



19: 



240 



ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc cag gat 2 83 

He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Gin Asp 
35 SO * 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg ' 336 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 34 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 12 0 " 125 



43: 



480 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 
180 185 " 190 
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aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
.195 200 * 205 

caa cga eta ggg act caa aaa gag ttg g gg gaa ttg gtg gca ttt ttg 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Glv Glu Leu Val A"? a Phe Leu 
210 215 22 0 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 7?0 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 755 
Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 83 
<211> 254 
<212> PRT 

<213> Artificial Seqnence 
<220> 

<223> -HHDH P016099-A1 



<400> 83 



Met 


Thr 


Thr 


Ala 


He 


1 








5 


Ser 


Ala 


Leu 


Arg 


Leu 








20 




Glu 


Ser 


Phe 


Lys 


Gin 






•35 






Pro 


Gin 


-Leu 


Lys 


Pro 




■50 








Ala 


Val 


Thr 


Ser 


Ala 


65 










He 


Phe 


Ala 


Pro 


Glu 


Tyr Arg Gly 


Ala 


85 
Val 








100 




Asn 


Ala 


Val 


Ala 


Ser 






115 






Phe 


He 


Thr 


Ser 


Ala 




130 








Tyr 


Thr 


Ser 


Ala 


Arg 


145 










Lys 


Glu 


Leu 


Gly 


Glu 










155 


Tyr 


Leu 


His 


Ser 


Glu 








180 




Lys 


Thr 


Asn 


Pro 


Glu 






195 






Gin Arg Leu 


Gly 


Thr 




210 








Ala 


Ser 


Gly 


Ser 


Cys 


225 











Val Thr Asn Val Lys His 
10 

Ser Glu Ala Gly His Thr 
25 

Lys Asp Glu Leu Glu Ala 
40 

Met Ser Glu Gin Glu Pro 
55 

Tyr Gly Gin Val Asp He 
70 " 75 

Phe Gin Pro He Asp Lys 
90 

Glu Ala Leu Gin He Arg 
105 

Gin Met Lys Lys Arg Lys 
120 

Ala Pro Phe Gly Pro Trp 
135 

Ala Gly Ala Cys Thr Leu 
150 155 
Tyr Asn He Pro Val Phe 
170 

Asp Ser Pro Tyr Phe Tyr 
135 

His Val Ala His Val Lys 
200 

Gin Lys Glu Leu Gly Glu 
215 

Asp Tyr Leu Thr Gly Gin 
230 235 



Phe Gly Gly Met Gly 
15 

Val Ala Cys His Asp 
30 

Phe Ala Glu Thr Tyr 
4 5 

Ala Glu Leu He Glu 
60 

Leu Val Ser Asn Asp 
80 

Tyr Ala Val Gin Asp 

95" 

Pro Phe Ala Leu Val 
110 

Ser Gly His He He 
125 

Lys Glu Leu Ser Thr 
140 

Ala Asn Ala Leu Ser 
160 

Ala lie Gly Pro Asn 
175 

Pro Thr Glu Pro Trp 
190 

Lys Val Thr Ala Leu 
205 

Leu Val Ala Phe Leu 
220 

Val Phe Trp Leu Ala 
240 
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Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 89 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016231-A-03 

<221> CDS 

<222> (1) . . . (765) 

<400> 89 

atg ago acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg gqt 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 ~ is 



get gtc acc' age get tac ggt cag gtc gat gtc ctg gtc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 



aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 125 



48 



age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa -get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 so 



240 



ate ttt get tea gaa ttt cag cca ate gat' aaa tac gee gtc gaa gat 238 
He Phe Ala Ser Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 95 - 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 



384 



ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get tgt act' eta get aat get eta teg 480 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu -Ser 
1« 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 



95 



WO 2005/018579 



PCT/US2004/004819 



165 170 175 



tat eta cac teg gag gat teg ceg tac tte tac ccg ' act gag ccq tea 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Gil Pro Sp 
130 135 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act act eta 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lyf ?al- Thr £ Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttq 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe £eu 



210 215 



40 



I? 8 = Ct ?? C o Ct t9t 9St tat ttg aCt 93C Ca 9 9 C 3 ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ma 

230 235 * ? 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc qaa taa 
uly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 .250 



<210> 90 
<211> 254 
<212> PRT 

<213> Artificial Sequence' 
<220> 

<223> HHDH P01S231-A-03 
<400> 90 

Met ser' Thr Ala lie Val Thr Asn Val Lys His Phe Gly Glv Met Gly 

5 10 * 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asn 
20 25 30 * 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 

35 -40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 

50 55 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

70 75 8 q 

lis Phe Ala Ser Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 110 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 

Phe lie Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 

165 170 1J5 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Tro 

180 135 190 * 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 



576 



524 



672 



720 



765 
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195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 * 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 ~ 235 ~ 240 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 91 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH P016231-E-03 

<221> CDS 

<222> (1) . . . (765) 

<400> 91 

atg age acc get ate gtc ace aac gtc aag cat ttt gga ggt atg ggt 4 8 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
1.5 10 15 

age get ctg agg ctg agc.gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 2 5 3 0 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
• 50 55 60 

get gtc acc age get tac ggt cag gtc gat gtc ctg gtc age aac gat 24 0 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val' Leu Val Ser Asn Asp " 
65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
lie Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 
85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 . 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 
115 120 ' 125 

ttc ate act teg get act ccg ttc ggg cca tgg aag gag eta teg .act 432 
Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 " 140 

tac act teg get cga get ggg get tgt act eta get aat get eta teg 480 
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Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 l6Q 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg C eg aat 523 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly p ro A sn 
165 170 17S 

tac eta cac teg gag gat teg ccg tac tat tat ccg act qag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 
180 135 i9o 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 62^ 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val "Thr Ala Leu 
195 200 * 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 67^ 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Giy Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
■225 230 235 ~ 2 40 



720 



ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 755 
Gly Giy Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 92 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HKDH P016231-E-03 
. <4Q0> 92 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 ^ 10 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala 'Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

'He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 • 90 * 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 

115 120 * 125 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 ~ 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
145 150 155 i6o 
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Lvs G"l u 


Leu 


Glv* Gin Tvr 


Asn lie Pro Val Phe Ala 


lie 


Gly Pro Asn 






ice: 
J. b 3 


170 






175 




Tyr Leu 


His 


Ser Glu Asp 


Ser Pro Tyr Tyr Tyr Pro 


Thr 


Glu 


Pro 


Trp 






180 


185 




190 




Lys Thr 


Asn 


Pro Glu His 


Val Ala His Val Lys Lys 


Val 


Thr 


Ala 


Leu 




195 




200. 


205 








Gin Arg 


Leu 


Gly Thr Gin 


Lys Glu Leu Gly Glu Leu Val 


Ala 


Phe 


Leu 


210 






215 220 










Ala Ser 


Gly 


Ser Cys Asp 


Tyr Leu Thr Gly Gin Val 


Phe 


Trp 


Leu 


Ala 


225 




230 


235 








240 


Gly Gly 


Phe 


Pro Met lie 


Glu Arg Trp Pro Gly Met 


Pro 


Glu 










245 


250 











<210> 93 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00827801 

<221> CDS 

<222> (1) . . . (765) 

<400> 93 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 43 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 is 

age get ctg agg ctg age gaa ggt ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Gly Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 " 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get ttt ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
b5 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala. Val Glu Asp 
85 90 ^ 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 110 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly Kis lie lie 
115 120 125 
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Ph* Ti tC tCg 9 " 9CC CCg ttC 9gg cca t99 gag eta tea act d 3 * 

Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Ly | | X J Leu j£ 

135 140 

S £ J2 S S S S? E £ S S S £ S £ £ «• 

15 °. • iss 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccq aat 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He lly Pro Asn 
it»5 170 17g 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccq tea 
Tyr Leu Hxs Ser Glu Asp Sar Pro .Tyr Phe Tyr Pro Thr Glu Pro S 
180 • 135 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lyf Vat Thr 111 Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt tea 
•Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu ?al S Phe ll ' 
" 10 215 220- 

lit e" t9t 93t " 9 aCt " C Ca 9 St9 ttt tgg ttg gca 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ma 

240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atq ccc aaa taa 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro lly Me? Pro Tlu " 
245 250 

<210> 94- 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S00827801 * 



<400> 94 








Met 


Ser 


Thr 


Ala 


lie 


Val 


1 








5 




Ser 


Ala 


Leu 


Arg 


Leu 


Ser 








20 






Glu 


Ser 


Phe 


Lys 


Gin 


Lys 






35 








Pro 


Gin 


Leu 


Lys 


Pro 


Met 




50 








Ala 


Val 


Thr 


Ser 


Ala 


Phe 


65 










70 


lie 


Phe 


Ala 


Pro 


Glu 


Phe 










85 




Tyr Arg 


Gly 


Ala 


Val 


Glu 








100 






Asn 


Ala 


Val 


Ala 


Ser 


Gin 



blu Gly Gly His Thr Val Ala Cys His Asi 

25 30 
Asp Glu Leu Glu Ala Phe Ala Glu Thr Ty^ 

4 ° 45' 
Ser Glu Gin Glu Pro Ala Glu Leu He Gl\ 
55 60 

Gly Gin Val Asp Val Leu Val Ser Asn Asr 
75 ' 80 
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115 120 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn Ala Leu Se^ 
145 150 155 i 6 o 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

155 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 - igo 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 .235 " 240 

Gly Gly Phe Pro Met He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 95 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00890554 

<221> CDS 

<222> (1) . . . (755) 

<400> 95 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 46 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 3 0 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get ttt ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get eta gaa ttt cag cca ate gat aaa tac get gtc gaa gat 283' 
He Phe Ala Leu Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
85 90 ' 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 3 36 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 HO 
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aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lis lie 
• US 120 

ttc ate act teg get gee ccg ttc ggg cca tgg aag gag eta tea act 
Phe lie Thr Ser Ala Ala Pro Phe Gly P ro T g hy l ^ Leu ^ 
130 13s ... 



384 



tac act teg get cga get ggg get agt act eta get aat get eta tea 
Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn Ila Leu Ser 
5 150 1S5 leo 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggq ccq aat 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 
155 170 175 

tac eta cac teg gag gat teg ccg tac tat tac ccc act gag ccg tgq 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 
180 las igo 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 
Lys lie Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg ' 
<-ln Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Ph= Leu 
210 215 220 • 

gca tct ggc tet tgt gat tat ttg act ggc cag gtg ttt tgg ttq qca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
' 2S 230 235 240 

ggc ggc ttt ccc acg ata gaa cgt tgg ccc ggc atg ccc gaa taa 
Gly Gly Phe Pro Thr lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 . 250 



<210> 96 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223=> HHDH S00890554 
<400> 96 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 io 1S 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cvs His Asd 

20 25 3* 0 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Th- Tv- 
35 40 AS " *~ 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 5 5 60 

Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asd 
65 70 75 so - 



430 



528 



576 



624 



672 



720 



765- 
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lie 


Phe 


Ala 


Leu 


Glu 


Phe 


Gin 


Pro He Asp Lys Tyr Ala Val 


Glu 


Asp 










85 






90 


95 


Tyr 


Arg 


Gly 


Ala 
100 


Val 


Glu 


Ala 


Leu Gin lie Arg Pro Phe Ala 
105 no 


Leu 


Val 


Asn 


Ala 


Val 
115 


Ala 


Ser 


Gin 


Met 


Lys Lys Arg Lys Ser Gly His 
120 125 


He 


He 


Phe 


He 
130 


Thr 


Ser 


Ala 


Ala 


Pro 
135 


Phe Gly Pro Trp Lys Glu Leu 
140 


Ser 


Thr 


Tyr 


Thr 


Ser 


Ala 


Arg 


Ala 


Gly 


Ala Ser Thr Leu Ala Asn Ala 


Leu 


Ser 


145 










150 




155 




160 


Lys 


Glu 


Leu 


Gly Glu 


Tyr 


Asn 


He Pro Val Phe Ala He Gly Pro. Asn 










165 






170 


175 




Tyr 


Leu 


His 


Ser 


Glu 


Asp 


Ser 


Pro Tyr Tyr Tyr Pro Thr Glu 


Pro 


Trp 








180 








185 190 




Lys 


He 


Asn 
195 


Pro 


Glu 


His 


Val 


Ala His Val Lys Lys Val Thr 
200 205 


Ala 


Leu 


Gin 


Arg 


Leu 


Gly Thr 


Gin 


Lys 


Glu Leu Gly Glu Leu Val Ala 


Phe 


Leu 




210 










215 


220 






Ala 


Ser 


Gly 


Ser 


Cys 


Asp 


Tyr 


Leu Thr. Gly Gin Val Phe Trp 


Leu 


Ala 


225 










230 




23 5 




240 


Gly 


Gly 


Phe 


Pro 


Thr 
245 


He 


Glu 


Arg Trp Pro Gly Met Pro Glu 
250 







<210> 97 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00994580 

<221> CDS 

<222> (1) . . . (765) 

<400> 97 

atg age acc get att gtc acc aac gtc aaa cat ttt gga ggt atg ggt " 43- 

Met Ser Thr Ala He Val ' Thr Asn Val Lys His Phe Gly Gly Met Gly 
15 10 ' " 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala" Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg att gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 "60 

get gtc acc age get ttt ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

ate ttt get eta gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
He Phe' Ala Leu Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp . 
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85 go 



95 



tac agg ggt get gtc gaa get ctg cag ate agg cea ttt get eta eta 
Tyr Arg Gly Ala Val Glu Ala Leu Gin U e Arg Pro Phe J la Leu JJf 
100 105 110 

aat get gtg get teg caa atg aag aag cga aag teg, ggg cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His lie lie 
115 120 125 

ttc ate act teg get gee ccg ttc ggg cca tgg aag gag eta teg act 
Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Tr P Lys Glu Leu Ser 
130 135 140 



tac act teg get cga get ggg get agt act eta get aat get eta teg 
Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn Ala Leu Ser 
145 150 155 iso 



336 



384 



432 



480 



aag gag eta gga gag tac aat ate ccg gtg ttc get .ate ggg ccg aat 523 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac tat tac ccc act gag ccq tgg 57b - 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro- Trp 
180 las lgQ 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 524 
Lys lie Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ila 
225 230 235 " 24Q 

ggc ggc ttt ccc acg ata gaa cgt tgg ccc ggc atg ecc gaa taa 755 
toly Gly Phe Pro Thr lie Glu Arg Trp Pro' Gly Met Pro Glu * 
245 250 

<210> 98 
<21I> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223=> HHDH S00994580 
<400> 9a 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 ' 15 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asd 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tvr 
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35 40 45 

Pro Gin -Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80~ 

He Phe Ala Leu Glu Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

85 90* ^ . 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly .His He He 

115 120 ~ 125 

Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 • 135 * 140 

Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 

165 170 175 

Tyr Leu His. Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr. Glu Pro Trp 

180 185 190 

Lys lie Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 " ' 205 

Gin Arg Leu Gly Thr- Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 ~ 240 ' 

Gly Gly Phe Pro Thr He Glu Arg Trp Pro Gly Met Pro Glu 
245 " 250 

'<210> 99 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S01018044 

<221> CDS 

<222> (1) . . , (765) 

<400> 99 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 4 8 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 
1 5 10 is 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
•Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aag cat aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys His Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
"35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 192 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 

get gtc acc age get ttt ggt cag gtc gat gtc ctg gtc age aac gat . 240 
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Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
5 . 70 " 80 

t^ C n^"!?* gaa tfct C " C " atC 9at aaa tac Set gtc gag gat n B8 

He Phe Ala Leu Glu Phe Arg Pro He Asp Lys Tyr Ala Val Slu Isl 

85 90 . .95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta eta 7->s 
Tyr Arg Gly Ala Val Glu Ala Leu Gin U e Ar| Pro Phe III Leu Si 
100 105 11Q 

aat get gtg get teg caa atg aag aag cga aag teg gqq cac ate ate ™<i 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lyf Se? fly nit lie lie 
115 12 o 125 

" C ^ C tc 3 9ct gee ccg ttc ggg cca tgg' aag gag eta teg act <i 3 " 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Slu Leu Ser ?hr 
130 135 140 

tac tct teg get cga get ggg get agt gca eta get aat get eta tea 4 S n 
Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ila Asn Ila Leu Ser 

iso 155 1S0 

aag gag eta gca gag tac aat ate ccg gtg ttc get ate ggg ccq aat 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie G?y Pro III 
15 5 170 175 

tac eta cac teg gag gat teg ccg tac tat tac ccc act gag ccg tec 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Tr P 
130 las igo 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act act eta 
Lys lie Asn Pro Glu His Val Ala His Val Lys Lys Val Thr 111 Leu 
195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ila Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu ila 

- JU 240 

ggc ggc ttt ccc gtc ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Val lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 100 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HKDH S0108044 
<400> 100 



528 



576 
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Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 'is 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cvs His Asp 

20 25 30 

Glu Ser Phe Lys His Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

3 5 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

lie Phe Ala Leu Glu Phe Arg Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 " * 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 ■ .140 

Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser ' 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 

180 185 " 190 

Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 ~ 205 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 ' 235 240 

Gly Gly Phe Pro Val He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 101 
<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S01035939 

<221> CDS 

<222> (1) . . . (765) 

<400> 101 

atg age acc get att gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 96 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aag cat aaa gat gaa ctg gaa get ttt get gaa acc tac 144 
Glu Ser Phe Lys His Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 
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cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg att gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 
50 55 60 

get gtc ace age get ttt ggt cag gtc gat gtc ctg gtc age aac cat 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 so 

ate ttt get eta gaa ttt egg cca ate gat aaa tac get gtc gag gat 
He Phe Ala Leu Glu Phe Arg Pro He Asp Lys Tyr Ala Val Glu Asp 



85 so 



95 



tac tct teg get cga get ggg get agt gca eta get aat get eta tea 
Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser 
• 14S 150 • 155 160 



caa cga eta ggg act eaa aaa gag ttg ggg qaa ttg gtg gca ttt ttg 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 



191 



240 



283 



tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 33b - 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arq Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cac ate ate 3 8* 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 ' 12 s 

ttc ate act .teg get gec ccg ttc ggg cca tgg aag gag eta teg act 4^2 
•Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 . 



480 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat '593 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 

tac eta cac teg gag gat teg ccg tac tat tae ccc act gag ccg tgg 576 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 
180 135 190 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 6~>* 
Lys lie Asn Fro Glu His Val Ala His Val Lys Lys Val Th>- Ala Leu 
195 200 205 



672 



gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp -Leu Ala 
225 230 235 24Q 

ggc ggc ttt ccc gtc ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Val He Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 



<210> 102 

<211> 254 

<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> HHDH S01035939 



<400> 102 



Met 


Ser 


Thr Ala 


He 


Val Thr Asn Val Lys His 


Phe. 


Gly Gly Met 


Gly 


1 






5 


10 








15 




Ser 


Ala 


Leu Arg 


Leu 


Ser Glu Ala Gly His Thr 


Val 


Ala 'Cys 


His 


Asp 






20 




25 






30 




Glu 


Ser 


Phe Lys 


His 


Lys Asp Glu Leu Glu Ala 


Phe Ala .Glu Thr Tyr 






3 5 




40 




45 








Pro 


Gin 


Leu Lys 


Pro 


Met Ser Glu Gin Glu Pro 


Ala 


Glu 


Leu 


He 


Glu 




50 






55 


60 










Ala 


Val 


Thr Ser 


Ala 


Phe Gly Gin Val Asp Val 


Leu 


Val 


Ser 


Asn Asp 


65 








70 75 










80 


He 


Phe 


Ala Leu 


Glu 


Phe Arg Pro lie Asp Lys 


Tyr 


Ala 


Val 


Glu 


Asp 








85 


.90 








95 




Tyr 


Arg 


Gly Ala 


Val 


Glu Ala Leu Gin He Arg 


Pro 


Phe 


Ala 


Leu 


Val 






100 




105 






110 






Asn 


Ala 


Val Ala 


Ser 


Gin Met Lys Lys Arg Lys 


Ser Gly His 


He 


lie 






115 




120 




125 








Phe 


He 


Thr Ser 


Ala 


Ala Pro Phe Gly Pro Trp 


Lys 


Glu 


Leu 


Ser 


Thr 




130 






135 


140 










Tyr 


Ser 


Ser Ala 


Arg 


Ala Gly Ala Ser Ala Leu 


Ala 


Asn 


Ala 


Leu 


Ser 


145 








150 155 










160 


Lys 


Glu 


Leu Gly 


Glu 


Tyr Asn He Pro Val Phe 


Ala 


He 


Gly Pro Asn 








165 


170 








175 




Tyr 


Leu 


His Ser 


Glu 


Asp Ser Pro Tyr Tyr Tyr 


Pro 


Thr 


Glu 


Pro Trp 






180 




185 






190 






Lys 


lie 


Asn Pro 


Glu 


His Val Ala His Val Lys 


Lys 


Val 


Thr 


Ala 


Leu 






195 




200 




205 








Gin 


Arg 


Leu Gly 


Thr 


Gin Lys Glu Leu Gly Glu 


Leu 


val 


Ala 


Phe 


Leu 




210 






215 


220 










Ala 


Ser 


Gly Ser 


Cys 


Asp Tyr Leu Thr Gly Gin 


Val 


Phe 


Trp 


Leu 


Ala 


225 








230 235 










240 


Gly 


Gly 


Phe Pro 


Val 


He Glu Arg Trp Pro Gly 


Met 


Pro 


Glu 












245 


250 













<210> 103 
<211> 768 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S01009684 

<221> CDS 

<222> (1) . . . (765) 

<400> 103 

atg age act gcg att gtt act ; .aac gtc aaa cat ttc ggt ggt atg ggg 4 8' 

Met Ser Thr Ala He Val Thr' Asn Val Lys His Phe Gly Gly Met Gly 

1 .5 10 15 

agt gcg ctg aaa ctg tc'g gag gec ggt cat aca gta gcg tgt cac gat 96 
Ser Ala Leu Lys Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 
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gaa a^ ttc aaa caa aag gac gaa ttg gaa gcc ttt gcg gaa act tac km 
Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu T hr T ^ 
35 40 45 7 " 

cca cag ctg aaa ccg atg tea gaa cag gag cca gcg gag tta att qaq ,„ 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ila Glu Leu lie Glu 



100 105 



110 



aat gcc gta get agt caa atg aaa aaa cgt aaa tea got cat att ate 
Asn Ala val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He lie 
115 120 



125 



ttc att act age gcg get ccg ttt ggt cca tgg aag gag ctg tea act 
Phe lie Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys 111 Leu Ser ?£ 
2-30 135 



140 



190 



aag ate aac cct gaa cat gtg gcg cat gtt aaa aaa gta aca gcc tta 
Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 



205 



caa cgt ctg ggt act caa aaa gaa ctt ggc gag ctg gtt gcg ttt etc 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

get tct ggt age tgc gat tac ttg ace ggc caa gtc ttc tgg tta gcc 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

230 235 240 

ggt ggc ttt ccg gtg att gag cgc tgg cca ggt atg ccg gaa taa 
Gly Gly Phe Pro Val Ila Glu Arg Trp Pro Gly Met Pro Glu * 
■ 245 250 



240 



50 • ss 

gcg gtc acc agt gcc ttc ggc cag gtg gat gtc ctg gtt aqc aac aac 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn IZ 
65 70 75 ao 

ti° ^ S f CtS 9aa ttC Cag cc9 aCt 9 ac aaa tat 9cc gtc gaa qac ' 288 
He Phe Ala Leu Glu Phe Gin Pro lie Asp Lys Tvr Ila Val Glu Asp 
85 . 90 " . 95 V 

tac cgt ggt gcg gta gaa gca c'tg cag ate aag ccg ttc gcg ttg gtt 33s 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Lys Pro Phe 111 Leu ?al 



384 



432 



tat agt agt gcg cgc gcc ggg gcc tec gcg ttg get aac gca ctg agt 

Tyr Ser Ser Ala .Arg Ala Gly Ala Ser Ala Leu Ala Asn Ila Leu sL 

145 150 155 i 6 o 

aaa gaa tta ggt gag tat aat ate cct gtt ttc gcc att ggg cca aae ' 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 
I 65 170 175 

tat ctg cac tea gaa gat age cca tac tat tac cca acg gaa ccg tgg 576 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 
180 185 



480 



528 



624 



572 



720 



765 



110 
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768 



tga 

<210> 104 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S01009684 
<400> 104 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 is 

Ser Ala Leu Lys Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 ■ 45 

Pro Gin Leu Lys Pro Met Ser- Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 go 

Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

He Phe Ala Leu Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 " ' 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Lys Pro Phe Ala Leu Val 

100 105 no 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 " 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr' 

130 135 140 

Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser 
145 150 155 iso 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 

180 185 190 

Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 J 205 

Gin Arg Leu Gly' Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

210 215 220 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 " 240 

Gly Gly Phe Pro Val He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 

<210> 105 
<211> '774 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00817219 

<221> CDS 

<222> (1) . . . (765) 

<400> 105 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 48 
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Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

Ser ?f r t9 *" f" 9 390 S f S 9Ct 99t Cat acc 9 tc 9 ct cat gat 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get- gaa acc tac 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu ?hr Tvr 
35 4 0 



45 



• cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu lie Glu 



50 55 



60 



110 



aat get gtg get teg caa atg aag aag cga aag teg ggg eac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lvs Ser Gly His lie III 
115 120 



125 



130 135 



tac act teg get cga get ggg get tgt tec eta get aat act eta tea 
Tyr Thr Ser Ala Arg Ala Gly Ala Cys Ser Leu Ala Asn Ala Leu Ser 



145 150 



155 ■ 160 



aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 
Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 
165 170 



175 



tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Tro 
180 135 "- 



190 



aag act aat ccg gag cac gtg get cac gtg aag aag gtg act act eta 
Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
195 200 



205 



gca tct ggc tot tgt gat tat ttg act ggc cag gtg ttt tag ttg aca 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 



95 



144 



192 



240 



f? ffi lu C « 9C 3Ct ta ° 99t Cag gtc 9 at 9 tc ct 9 3tc age aac gat 
Ala Val Thr Ser Ala Tyr Gly Gin Val Asp' Val Leu Val Ser Asn £p 

b5 70 75 go 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc aaa gat 288 
He. Phe Ala Pro Glu Phe Gin Pro lie Asp Lys Tvr Ala Val !lu lip 
85 9 ° 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt act eta gtg 335 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Arg Pro Phe Ila Leu Val 
100 105 



384 



Phf ^ o CS ?f 9 f CCg ttC 999 CCa tg 9 aa 9 9*9 ct * teg act 43? 

Phe lie- Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

140 



480 



528 



576 



624 



caa cga eta ggg act caa aaa gag ttg ggg gaa ttg gtg gea ttt ttg 67? 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 



720 
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225 230 235 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 
Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 250 

tgaggatcc ' 774 

<210> 106 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S00817219 



<400> 106 



Met Ser Thr 


Ala 


He 


Val 


Thr Asn Val Lys His 


Phe Gly Gly Met Gly 


1 




5 




10 








15 




Ser Ala Leu 


Arg Leu 


Ser 


Glu Ala Gly His Thr 


Val 


Ala 


Cys 


His Asp 




20 






25 






30 






Glu Ser Phe 


Lys 


Gin 


Lys 


Asp Glu Leu Glu Ala 


Phe 


Ala 


Glu 


Thr 


Tyr 


35 








40 




45 








Pro Gin Leu 


Lys 


Pro 


Met 


Ser Glu Gin Glu Pro 


Ala 


Glu 


Leu 


lie 


Glu 


50 








55 


60 










Ala Val Thr 


Ser 


Ala 


Tyr 


Gly Gin Val Asp Val 


Leu 


Val 


Ser 


Asn Aso 


65' 






70 


75 










80 


He Phe Ala 


Pro 


Glu 


Phe 


Gin Pro He Asp Lys 


Tyr Ala Val 


Glu 


Asp 






85 




90* 








95 


Tyr Arg Gly 


Ala 
100 


Val 


Glu 


Ala Leu Gin He Arg 
105 


Pro 


Phe 


Ala 
110 


Leu 


Val 


Asn Ala Val 


Ala 


Ser 


Gin 


Met Lys Lys Arg Lys 


Ser Gly His 


He 


He 


' 115 








120 




125 








Phe He Thr 


Ser 


Ala 


Ala 


Pro Phe Gly Pro Trp 


Lys 


Glu 


Leu 


Ser 


Thr 


.130 








135 


140 










Tyr Thr Ser 


Ala 


Arg 


Ala 


Gly Ala 'Cys Ser Leu 


Ala 


Asn 


Ala 


Leu 


Ser 


145 






150 


155 










160 


Lys Glu Leu 


Gly Glu 


Tyr 


Asn He Pro Val Phe 


Ala 


He 


Gly 


Pro 


Asn 






165 




170 






175 




Tyr Leu His 


Ser 


Glu 


Asp 


Ser Pro Tyr Phe Tyr 


Pro 


Thr 


Glu 


Pro 


Trp 




180 






185 






190 




Lys Thr Asn 


Pro 


Glu 


His 


Val Ala His Val Lys 


Lys 


Val 


Thr 


Ala 


Leu 


195 








200 




205 








Gin Arg Leu 


Gly Thr 


Gin 


Lys Glu Leu Gly Glu 


Leu 


Val 


Ala 


Phe 


Leu 


210 








215 


220 










A.la Ser Gly 


Ser 


Cys 


Asp 


Tyr Leu Thr Gly Gin 


Val 


Phe 


Trp 


Leu 


Ala 


225 






230 


235 










240 


Gly Gly Phe 


Pro 


Met 
245 


He 


Glu Arg Trp Pro Gly 
250 


Met 


Pro 


Glu 







<210> 107 
<211> 774 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00708827 
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<221> CDS 

<222> (1) . . . (765) 

<400> 107 

atg age acc get ate gtc ace aac gtc aaa cat ttt gga ggt atg ggt 

Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 5 10 15 

age get ctg agg ctg age gaa get ggt cat acc gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 



43 



96 



gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gac ctg att gaa 192 

Pro' Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Asp Leu lie Glu 

50 55 60 

•get gtc acc age get tac ggt cag gte gat gtc ctg gtc age aac gat 240 

Ala Val Thr Ser Ala Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 

65 70 75 80 

ate ttt get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 

lie Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 95 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get eta gtg 336 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 no 

aat get gtg get teg caa atg aag aaa cga aag teg ggg cac ate ate 3 84 

Asn' Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 " ' 125 

tte ate act teg get act ccg ttc ggg cca tgg aaa gag eta teg act 432 

Phe He Thr Ser Ala Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 ' 140 

tac act teg get ega get ggg get tgt act eta get- aat get eta teg 480 

Tyr Thr Ser Ala Arg Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 

145 150 155 160 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 

Lys Glu Leu Gly Glu Tyr Asn lie Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

tac eta cac teg gag gat teg ccg tac ttc tac ccg act gag ccg tgg 576 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 
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caa cga eta ggg act caa aaa gag ttg 
Gin Arg Leu Gly Thr Gin Lys Glu Leu 
210 215 

gca tct ggc tct tgt gat tat ttg act 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr 
225 230 

ggc ggc ttt ccc ata ata gaa cgt tgg 
Gly Qly Phe Pro He He Glu Arg Trp 
245 

tgaggatcc 

<2io> ioa 

<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S00708827 



ggg gaa ttg gtg gca ttc ctg 1572 

Gly Glu Leu Val Ala Phe Leu 
220 

ggc cag gtg ttt tgg ttg gca 720 

Gly Gin Val Phe Trp Leu Ala 

235 240 

ccc ggc atg ccc gaa taa 765 

Pro Gly Met Pro Glu * 

250 



774 



<400> 108 
Met Ser Thr 
1 

Ser Ala Leu 

Glu Ser Phe 
35 

Pro Gin Leu 
50 

Ala Val Thr 
65 

He Phe Ala 



Ala He 
5 

Arg Leu 
20 

Lys Gin 
Lys Pro 
Ser Ala 



Pro Glu 
85 

Tyr 'Arg Gly Ala Val 
100 
..Ala Ser 



Asn Ala Val. 

115 

Phe He Thr 
130 

Tyr Thr Ser 
145 

Lys Glu Leu 

Tyr Leu His 

Lys Thr Asn 
195 

Gin Arg Leu 

210 
Ala Ser Gly 
225 

Gly Gly Phe 



Ser Ala 

Ala Arg 

Gly Glu 
165 
Ser Glu 
180 

Pro Glu 

Gly Thr 

Ser Cys 

Pro He 
245 



Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

10 15 
Ser Glu Ala Gly His Thr Val Ala Cvs His Asp 

25 30 
Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

40 45 
Met Ser Glu Gin Glu Pro Ala Asd Leu He Glu 

55 60 
Tyr Gly Gin Val Asp Val Leu Val Ser Asn Asp 
70 75 ao 

Phe Gin Pro lie Asp Lys Tyr Ala Val Glu Asp 

90 95 
Glu 'Ala Leu Gin He Arg Pro Phe Aia Leu Val 

105' no' 
Gin Met Lys Lys Arg Lys. Ser Gly His He He 

120 125 
Thr Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

135 140 
Ala Gly Ala Cys Thr Leu Ala Asn Ala Leu Ser 
150 155 160 

Tyr Asn lie Pro Val Phe Ala lie Gly Pro Asn 

170 ~ 175 

Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Trp 

185 190 
His Val Ala His Val Lys Lys Val Thr Ala Leu 

200 205 
Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 

215 220 
Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
230 235 ~ .240 

lie Glu Arg Trp Pro Gly Met Pro Glu 
250 



<210> 109 
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<211> 765 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> HHDH S00772501 

<221>- CDS 
"<222> (1) . . . (765) 

<400> 109 

atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atg ggt 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Glv Giy Met Gly 

15 10 " 15 



cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate gaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 
50 55 60 



tac eta cae teg gag gat teg ccg tae ttc tac ccg act gag ccg tgg 
Tyr Leu His Ser Glu Asp Ser Pro Tyr Phe Tyr Pro Thr Glu Pro Tro 



48 



age get ctg agg ctg age gaa get ggt cat acc' gtc get tgc cat gat 96 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 
20 25 30 

gaa age ttt aaa cag aaa gat gaa ctg gaa get ttt get gaa acc tac 144 

Glu Ser Phe Lys Gin Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 
35 40 45 



192 



get gtc ace age get ttt ggt cag gtc gat gtc ctg gtc age aac gat 240 
Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
*5 70 ■ 75 80 

ate ttt .get cca gaa ttt cag cca ate gat aaa tac get gtc gaa gat 288 
He Phe Ala Pro Glu Phe Gin Pro He Asp Lys Tyr Ala Val Glu Asp 
35 QQ 95 * 

tac agg ggt get gtc gaa get ctg cag ate agg cca ttt get cta.gtg 336 
Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 
100 105 no 

aat get gtg get teg caa atg aag aag cga aag teg ggg cae ate ate 3 84 

Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 
115 120 ' 125 

ttc ate act teg get gee ccg ttc ggg cca tgg aag gag eta teg act 432 
Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 
130 135 140 

tac act teg get cga get ggg get agt act eta get aat get eta teg 4 80 

Tyr Thr Ser Ala Arg Ala Gly Ala Ser Thr Leu Ala Asn Ala Leu Ser 
1« 150 155 iso 

aag gag eta gga gag tac aat ate ccg gtg ttc get ate ggg ccg aat 528 
Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 
165 170 175 



576 



116 
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180 185 190 

aag act aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 624 

Lys Thr Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 205 

caa cga eta ggg act caa aaa gag ttg ggg gaa ttg. gtg gca ttt ttg 672 

Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 

Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 

225 230 235 " 240 

ggc ggc ttt ccc atg ata gaa cgt tgg ccc ggc atg ccc gaa taa 765 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu * 
245 " 250 



<210> 110 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S00772501 



<400> 110 



Met 


Ser 


Thr 


Ala 


He 


Val 


Thr 


Asn Val Lys His Phe Gly Gly Met Gly 


1 








5 






10 15 


Ser 


Ala 


Leu 


Arg 


Leu 


Ser 


Glu Ala Gly His Thr Val Ala Cys His Asp 








20 








25 30 


Glu 


Ser' 


Phe 


Lys 


Gin 


Lys 


Asp 


Glu Leu Glu Ala Phe Ala Glu Thr Tyr 






35 










40 45 


Pro -Gin 


Leu 


Lys 


Pro 


Met 


Ser 


Glu Gin Glu Pro Ala Glu Leu He Glu 




50 










55 


60 


Ala 


Val 


Thr 


Ser 


Ala 


Phe 


Gly Gin Val Asp Val Leu Val Ser Asn Asp 


65 










70 




75 80 


lie 


Phe 


Ala 


Pro 


Glu 


Phe 


Gin 


Pro He Asp Lys Tyr Ala Val Glu Asp 










85 






90 95 


Tyr Arg 


Gly 


Ala 


Val 


Glu 


Ala 


Leu Gin He Arg Pro Phe Ala Leu Val 








100 








105 no 


Asn 


Ala 


Val 


Ala 


Ser 


Gin 


Met 


Lys Lys Arg Lys Ser Gly His He He 






115 










120 125 


Phe 


He 


Thr 


Ser 


Ala 


Ala 


Pro 


Phe Gly Pro Trp Lys Glu Leu Ser Thr 




130 










135 


140 


Tyr 


Thr 


Ser 


Ala 


Arg 


Ala 


Gly Ala Ser Thr Leu Ala Asn Ala Leu Ser 


145 










150 




155 160 


Lys 


Glu 


Leu 


Gly 


Glu 


Tyr 


Asn 


lie 'Pro Val Phe Ala He Gly Pro Asn 










165 






170 * 175 


Tyr 


Leu 


His 


Ser 


Glu 


Asp 


Ser 


Pro -Tyr Phe Tyr Pro Thr Glu Pro Trp 








180 








185 190 


Lys 


Thr 


Asn 


Pro 


Glu 


His 


Val 


Ala His Val Lys Lys Val Thr Ala .Leu 






195 










200 205 


Gin Arg 


Leu 


Gly 


Thr 


Gin 


Lys 


Glu Leu Gly Glu Leu Val Ala Phe Leu 




210 










215 


220 


Ala 


Ser 


Gly 


Ser 


Cys 


Asp 


Tyr 


Leu Thr Gly Gin Val Phe Trp Leu Ala 



117 
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225 23 0 

Gly Gly Phe Pro Met lie Glu Arg Trp Pro Gly Met Pro Glu 



245 

<210> 111 
<211> 762 
<212> DNA 

<213>' Artificial Sequence 
<220> 

<223> HHDH S01035968 

<221> CDS 

<222> (1) . . . (765) 

<400> 111 



250 



atg age acc get ate gtc acc aac gtc aaa cat ttt gga ggt atq qqt 
Met Ser Thr Ala lie Val Thr Asn Val Lys His Phe fly Ify Tel lly 



10 15 



age get ctg agg ctg age gaa get ggt eat acc gtc get tgc cat gat 
Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His 1st 
20 25 



30 



c 9C f CaC aaa 9at 9aa ct9 9aa act ttt get gaa acc tac 

Glu Ser Phe Lys Kis Lys Asp Glu Leu Glu Ala Phe Ila flu ?£r T\n- 

33 40 • 45 

cca cag ctg aaa cca atg age gaa cag gaa cca get gaa ctg ate qaa 
Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ila Glu Leu 111 Glu 
50 55 60 

get gtc acc age get ttt ggt cag gtc cat gtc ctg qtc acc aac oat- 
Ala Val Thr t Ser Ala Phe Gly Gin Val Asp Val Leu v5 sir Asn III 
65 ' 70 75 ao 

t^ C tt Z Cta 93a tfct C " ccaatc Sat aaa tac get gtc gag gat 

He Phe Ala Leu Glu Phe Arg Pro lie Asp Lys Tyr Ila Val Glu Isp 
85 90 95 

tac agg ggt get. gtc gaa get ctg cag ate agg cca ttt get cta ata 
Tyr Arg Gly Ala Val Glu Ala Leu Gin lie Ar| Pro Phe 111 Leu Va! 
100 ios 11Q 

aat get gtg get teg caa atg aag aag cga aag teg ggq cac ate ate 
Asn Ala Val Ala Ser Gin Met Lys Lys Arg Lys Set lly His lie lit 
1X5 120 125 

ttc ate act teg get gee ccg ttc ggg cca tgg aag gag cta tea act 
Phe lie Thr Ser Ala Ala Pro 'Phe Gly Pro Trp Lys 111 Leu Set Tnr 
130 135 140 

tac tct teg get cga get ggg get agt gca cta get aat act cta tea 
Tyr ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ila Asn 111 Leu s4 
" 150 155 160 

aag gag cta gga gag. tac aat ate ccg gtg ttc gec ate ggg C cg aat 



48 



96' 



144 



192 



240 



238 



336 



334 



432 



430 



523 



118 
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Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala He Gly Pro Asn 

165 170 175 

tac eta cac teg gag gat teg ccg tac tat tac ccc aet gag ccg tgg 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 

180 185 iso 

aag att aat ccg gag cac gtg get cac gtg aag aag gtg act get eta 

Lys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 

195 200 * 205 



576 



624 



caa cga eta ggg aet caa aaa gag ttg ggg gaa ttg gtg gca ttt ttg 672 
Gin Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val Ala Phe Leu 
210 215 220 

gca tct ggc tct tgt gat tat ttg act ggc cag gtg ttt tgg ttg gca 720 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gin Val Phe Trp Leu Ala 
225 230 235 * 240 

ggc ggc ttt ccc gtc ata gaa cgt tgg ccc ggc atg ccc gaa 762 
Gly Gly Phe Pro Val He Glu Arg Trp Pro Gly Met Pro Glu 
245 250 



<210> 112 
<211> 254 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HHDH S01035968 
<400> 112 

Met Ser Thr Ala He Val Thr Asn Val Lys His Phe Gly Gly Met Gly 

1 , 5 10 is 

Ser Ala Leu Arg Leu Ser Glu Ala Gly His Thr Val Ala Cys His Asp 

20 25 30 

Glu Ser Phe Lys His Lys Asp Glu Leu Glu Ala Phe Ala Glu Thr Tyr 

35 40 45 

Pro Gin Leu Lys Pro Met Ser Glu Gin Glu Pro Ala Glu Leu He Glu 

50 55 60 

Ala Val Thr Ser Ala Phe Gly Gin Val Asp Val Leu Val Ser Asn Asp 
65 70 75 80 

He Phe Ala Leu Glu Phe Arg Pro He Asp Lys Tyr Ala Val Glu Asp 

85 90 ^ * 95 

Tyr Arg Gly Ala Val Glu Ala Leu Gin He Arg Pro Phe Ala Leu Val 

100 105 ~ no 

Asn Ala. Val Ala Ser Gin Met Lys Lys Arg Lys Ser Gly His He He 

115 120 " 125 

Phe He Thr Ser Ala Ala Pro Phe Gly Pro Trp Lys Glu Leu Ser Thr 

130 135 140 

Tyr Ser Ser Ala Arg Ala Gly Ala Ser Ala Leu Ala Asn Ala Leu Ser 
145 150 155 160 

Lys Glu Leu Gly Glu Tyr Asn He Pro Val Phe Ala lie Gly Pro Asn 

165 170 * 175 

Tyr Leu His Ser Glu Asp Ser Pro Tyr Tyr Tyr Pro Thr Glu Pro Trp 
130 135 190 



119 
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L ys He Asn Pro Glu His Val Ala His Val Lys Lys Val Thr Ala Leu 
C-ln Arg Leu Gly Thr Gin Lys Glu Leu Gly Glu Leu Val' Ala Phe Leu 
Ala Ser Gly Ser Cys Asp Tyr Leu Thr Gly Gln Vax Phe Trp Leu Ala 
Gly Gly Phe Pro Val lie Glu Arg Trp Pro Gly Met Pro Glu 2 " 



<210> 113 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED S01040430 

<22l> CDS 

<222> (1) . . . (852) 

<400> 113 



atg get aaa aac ttt tec aat ate caa fat- « 

r «. , y= »_ ser Asn E S E E E £ H E S 



10 



15 



s? e | h e e s is s e e e e -J - H 



40 45 



2 E I" E E III S - - S E E E E - E 
E E E E E E E E S S E a E E E E 



80 



115 ' 120 

gat cag gat gac gat aaa cat ttt gac caq eta aft a*<- 

«p a. mp a=p SSP ^ ? is P„ a E oin S E E E E E 

140 



48 



96 



144 



192 



240 



ggc gta aaa gtc aag get tat aaa act aat- nhr f. 

Gly Val Lys Val Lys Ala Tyr Lys £ £ *° £ ^ £ g JJJ "8 

85 90 95 

gtg aaa cag act att gag caq caq ate aaa oat- en- „ 

v,l ^ 01, Thr ue «, 0^ oil E E E E My S E E " S 

105 110 
ata gtt gtg gcg aac gca ggc ate cca taq act- a = ~ m *. . 
He val val Ala Asn Ala Gly He Pro ri? S Lyl Gly i£ Tyr £ 



384 



432 



120 
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aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 480 
Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 ■ 160 

cgt ttc gaa aaa gaa ggc aaa aag ggc gcc ttg gtc ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
16 5 17 0 • 175 

atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg gcc tae aat 576 
Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Ala Tyr Asn 
180 185 190 

gcg gtc aag gca ggc gtg cgt cat ttc gca aag tec ctg gcc gtg gaa 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 
210 215 220 



acc gag ate tct gat ttc gtc ccg caa gaa aca cag aat aaa tgg tgg 

Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 240 

age tta gtt cca ttg ggc cgt ggc ggg gaa act gcg gaa tta gtt ggt 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gcc tac ctg ttc ctg gca agt gat gcg ggc tec tae gcc acg ggc acg 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 

260 265 270 



720 



768 



816 



gat ate att gtg gac ggc ggc tac acg ctg ccg tag 852 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 114 
<211> 233 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KRED S01040430 



<400> 114 



Met Ala Lys 


Asn 


Phe 


Ser 


Asn Val Glu Tyr 


Pro 


Ala 


Pro 


Ala Pro Ala 


1 




5 




10 








15 


His Thr Lys 


Asn 


Glu 


Ser 


Leu Gin "Val Leu 


Asp 


Leu 


Phe 


Lys Leu Asn 


20 






25 








30 


Gly Lys Val 


Ala 


Ser 


He 


Thr Gly Ser Asn 


Ser 


Gly 


He 


Gly Tyr Ala 


35 








40 






45 




Leu Ala Glu 


Ala 


Phe 


Ala 


Gin Val Gly Ala 


Asp 


Val 


Ala 


He Trp .Tyr 


50 








55 




60 






Asn Ser His 


Asp 


Ala 


Thr 


Gly Lys Ala Glu 


Ala 


Leu 


Ala 


Lys Lys Tyr 


65 




70 




75 






80' 


Gly Val Lys 


Val 


Lys Ala 


Tyr Lys Ala Asn 


Val 


Ser Ser Ser Asp Ala 
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85 9 ° 95 

Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His Leu £so 

100 10 5 110 

He Val val Ala Asn Ala Gly lie Pro Trp Thr Lys Gly Ala Tyr lie 

115 120 1 -, 5 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 P 

Lys Gly val Gly Tyr Val Ala Lys His Ala Gly Arg His. Phe Arg Glu 

^ ^ 155 i £n 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 



Met Ser Gly His lie Val Asn Val Pro Gin Phe Gin Ala Ala Tyf- Asn 

180 135 190 

Ala Val Lys Ala Gly Val Arg His. Phe Ala Lys Ser Leu Ala Val Glu 

195 200 205 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 

210 215 

Thr Glu lie Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 ^ 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

24 5 250 i 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly T hr 

260 265 * 7?0 

Asp Us He Val Asp Gly Gly Tyr Thr Leu Pro 
275 280" 

<210> 115 
<211> 852 
<2i2> DNA 

<213> Artificial Sequence 
<22Q> 

<223> KRED S01091361 

<221> CDS 

<222> (1) , . , (852) 

<400> 115 

atg get aaa aac ttt tec aat gtc gaa tat cct gec ccg gcg cca get 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Ala Pro Ala 
1 5 10 15 

HiJ Thr r aaa f C Hf* Ca9 9ta Ct9 gat Ct * ttc <=tg aac 

His Thi Lys Asp Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 2 5 30 

ggc aaa gtc gcg tct ate acc ggt age aac tea gge att ggt tac gcg 
Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly lie Gly Tyr Ala 
35 40 45 

ctg gec gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate too tat 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala As P Val Ala lie Tro 
50 55 60 

aac agt cat gat gee acc ggt aaa gca gag gee ctg get aaa aaa tat- . 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lvs Lys Tyr 
65 70 75 - l Q 



48 



96 



144 



192 



240 



122 
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ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 . 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 
Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His -Leu Asp 
100 ' 105 " no 



288 



336 



ata gtt gtg gcg aac gca ggc ate cea tgg act aag ggt gca tae ate 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 43 2 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 

aaa ggc .gca ggc tat gta gca • aaa cat gcg ggt cgc cat ttt cgt gaa 430 
Lys Gly Ala Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu ■ 
145 150 155 ~ ifiO 

cgt ttc gaa aaa gaa ggc aaa aag ggc gec ttg gtc ttt acg get tec 523 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac att gtt aat gtg ccg caa ttt cag gcg acc tae aat 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gtc aag gca ggc ■ gtg cgt cat ttc gca aag tec ctg gec gtg gaa 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

acc gag ate tct gat ttc gtc ccg cag gaa aca caa aat aaa tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Lys Trp Trp 
225 230 235 * 240 

age tta gtt cca ttg ggc cgt ggc ggg gaa act gcg gaa tta gtt ggt 768 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
245 250 255 

gee tae ctg ttc ctg gca agt gat gcg ggc tec tac gec acg ggc acg 816 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr Ala Thr Gly Thr 
260 265 ' * ' 270 

gat ate gtt gtg gac ggc ggc tac acg ctg ccg tag 852 
Asp He Val Val Asp Gly Gly Tyr Thr Leu Pro * 
275 280 



<210> 116- 
<211> 233 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> KRED S01091361 
<400> 116 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Ala Pro Ala 

1 5 10 15 

His Thr Lys Asp Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 

20 25 ~ 30 

Giy Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly He Gly Tyr Ala ■ 

35 40 45 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala lie Trp Tvr 

50 - 55 60 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tvr 
65 70 75 so 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 

100 105 * 110 

He Val Val Ala Asn Ala Gly lie Pro Trp Thr Lys Gly Ala Tyr lie 

• H5 120 2 95 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 

130 135 140 

Lys Gly Ala Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arq Glu 

145 150 155 ~ 160 - 

Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 

165 170 175 

•Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 

180 185 190 

Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 

l- q 5 200 n Qq 

Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 

210 215 99 0 

Thr 'Glu lie Ser Asp Phe Val Pro Gin. Glu Thr Gin Asn Lys Trp Trp 
* 2S 230 235 240 

Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 

245 250 955 

Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tvr Ala T hr Gly Thr 

260 265 " 270 

Asp He Val Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 117 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> KRED S01091625 

<221> CDS 

<222> (1) . . . (3520 

<400> 117 

atg get aaa aac ttt tec aat gtc gaa tat cct ccc ccg gcg cca act 
Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Ala Pro Ala 



43 



124 
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1 5 10 15 

cat acc aag aac gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 96 
' His Thr Lys Asn Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 ~ 30 

ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser Gly He Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gae gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 60 

aac agt cat gat gee acc ggt aaa gca gag gec ctg get aaa aaa tat 240 
Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala. Lys Lys Tyr 
65 70 75 80 

ggc gta aaa gtc -aag get tat aaa get aat gtc age teg agt gat gcg ' 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu Astd . 

100 105 * no ' 

ata gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tac ate 3 84 

He Val Val Ala Asn Ala Gly He Pre Trp Thr Lys Gly Ala Tyr He 
115 120 " ^ 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtg gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 " 135 140 

aaa ggc gca ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 480 
Lys Gly Ala Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
145 150 155 " 160 

cgt ttc gag aaa gaa ggc aaa aag ggc gee ttg gtc ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
165 170 175 

atg teg ggt cac ate gtt aat gtg ccg caa ttt cag gcg acc tac aat 576 
Met Ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
180 185 190 

gcg gtc aag gca ggc gtg cgt cat ttc gca aag tec ctg gee gtg gaa 624 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct' cct ggc tat att aat 672 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr He Asn 
210 215 220 

acc gag ate tct gat ttc gtc ccg caa gaa aca caa aat aga tgg tgg 720 
Thr Glu He Ser Asp Phe Val Pro Gin Glu Thr Gin Asn Arg Trp Trp 
225 230 235 " J 240 



125 
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age tta .gtt cca ttg gqc cat qqc caa a?>=» ™~ 
S*r L. vax P t c J 1?, £ l? y £ £ H ffi S S£ 

Oca ■* 



2so 255 



gec tac ctg ttc ctg gca agt gat gcg c qc tep „„ 

x yr ta a. ^ Ala ssr isp <£ gj £ «. s - ? «. « 

2 55 — _ _ 



Gly Lys Val Ala Ser lie Thr Gly Ser Asn Ser Gly H e Gly Tyr Ala 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
Asn Ser His Asp Ala Thr G^y Lys Ala Glu Ala' lL Ala Lys Lys Tyr 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ma 
Val Lys Gin Thr lie Glu Gin Gin lie Lys Asp P he Gly His Leu .Asp 
He Val val Ala Asn Ala Gly lie Pro Trp Thr Lys Gly Ma Tyr lie 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
Lys Gly Ala Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Jeu Val Phe Thr Ala III 



Met ser Gly His He Val Asn Val Pro Gin Phe Gin Ala Thr Tyr Asn 
xau 185 



Ala val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Val Glu 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro Gly Tyr lie Asn 
Thr Glu He Ser As P Phe Val Pro Gin Glu Thr Gl'n Asn Arg Trp Trp 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr Ala Glu Leu Val Gly 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala GlJ Ser Tyr Ala Thr IVy Thr 



7b8 



816 



yau ate att gtg gac ggc ggc tac acg ctg ccq taq 

Asp lie lie Val Asp Gly Gly Tyr Thr Leu Pro * 352 
275 280 

<210> 118 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> KHHD S01091625 
<400> 118 

Met Ala Lys Asn Phe Ser Asn Val Glu Tyr Pro Ala Pro Ala Pro. Ala 

His Thr Lys Asn Glu Ser Leu Gin Val IL Asp Leu Phe Lys Leu Asn ' 

2 5 



270 



126 
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Asp He lie Val Asp Gly Gly Tyr Thr Leu Pro 
275 280 

<210> 119 
<211> 852 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> BCRED S01094b48 

<221> CDS 

<222> (1) . . . (852) 

<400> 119 

atg get aaa aac ttt tec aat gtc gga tat cct gec ccg gcg cca get 48 

Met Ala Lys Asn Phe Ser Asn Val Gly Tyr Pro Ala Pro Ala Pro Ala 

15 10 is 



cat ace aaa age gaa tea ctg cag gta ctg gat ctg ttc aaa ctg aac 
His Thr Lys Ser Glu Ser Leu Gin Val Leu Asp Leu Phe Lys Leu Asn 
20 25 30 



aac age cat gat gee acc ggt aaa gca gag gee ctg get aaa aaa tat 
Asn Ser 'His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys Tyr 
65 70 75 * 80 



96' 



ggc aaa gtc gcg tct ate acc ggt age aac tea ggc att ggt tac gcg 144 
Gly Lys Val Ala Ser He Thr Gly Ser Asn Ser- Gly He Gly Tyr Ala 
35 40 45 

ctg gee gaa get ttt gcg cag gtt ggc gca gac gtt gcg ate tgg tat 192 
Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 
50 55 • 60 



240 



ggc gta aaa gtc aag get tat aaa get aat gtc age teg agt gat gcg 288 
Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 
85 90 95 

gtg aaa cag act att gag cag cag ate aag gat ttt ggc cac ctg gac 336 
Val Lys Gin Thr He Glu Gin Gin lie Lys Asp Phe Gly His Leu Asp 
100 105 " no 

ata gtt gtg gcg aac gca ggc ate cca tgg act aag ggt gca tac ate 3 84 

He Val Val Ala Asn Ala Gly He Pro Trp Thr Lys Gly Ala Tyr He 
115 120 125 

gat cag gat gac gat aaa cat ttt gac cag gtg gtt gac gtc gac ctg 432 
Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val Asp Val Asp Leu 
130 135 140 

aaa ggc gta ggc tat gta gca aaa cat gcg ggt cgc cat ttt cgt gaa 4 80 

Lys Gly Val Gly Tyr Val Ala Lys His Ala Gly Arg His Phe Arg Glu 
14 5 150 155 ~ 160 

cgt ttc gaa aaa gaa ggc aaa aag ggc gee ttg gtc ttt acg get tec 528 
Arg Phe Glu Lys Glu Gly Lys Lys Gly Ala Leu Val Phe Thr Ala Ser 
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165 17Q 



175 



atg teg ggt cac ate gtt aac gtg ccg caa ttt cag gcg gec tac aat 
Met Ser C-ly His He Val Asn Val Pro Gin Phe Gin La Ma j£ 
130 135 190 

gcg gtc aag gca ggc gtg cgt cat ttc gca aag tec ctg gec gtg gaa 
Ala Val Lys Ala Gly Val Arg His Phe Ala Lys Ser Leu Ala Vat Gin 
195 200 205 

ttt get cct ttc gca cgt gtt aac tct gta tct cct qqc tat att aat 
Phe Ala Pro Phe Ala Arg Val Asn Ser Val Ser Pro ITy J£ Tl e Asn 
10 215 220 



^ ^ C " Ct ? at " C 9 tc cc 9 99a aca cag aat aaa tgg tgg 

Thr Glu lie Ser Asp Phe Val Pro Gin Gly Thr Gin Asn Lys Trp T^p 

230 035 240 



age tta gtt cca ttg ggc cgt ggc ggg gaa act gcg gaa tta gtt act 
Ser Leu Val Pro Leu Gly Arg Gly Gly Glu Thr 111 Glu Leu ?2 It 
245 • . 250 

gec tac ctg ttc ctg gca agt gat gcg ggc tec tac gec acg gge a^q 
Ala Tyr Leu Phe Leu Ala Ser Asp Ala Gly Ser Tyr La Th? ITy 
2S0 • 265 - 270 

gat ate att gtg gac ggc ggc tac acg ctg ccg tag 
Asp He He Val Asp Gly Gly Tyr Thr Leu Pr- Q * 
275 280 



<210> 120 
<211> 283 
<212> PRT 

<213> Artificial Sequence 
<22G> 

<223> KRED S01094648 



<400> 120 

Met Ala Lys Asn Phe Ser Asn Val Gly Tyr Pro Ala Pro Ala Pro Ala 
5 10 



His Thr Lys Ser Glu Ser Leu Gin Val £eu Asp Leu Phe Lys tlu Asn 

2 0 t 5 

Gly Lys Val Ala Ser lie Thr Gly ier Asn Ser Gly H e Gly Tyr Ala 

Leu Ala Glu Ala Phe Ala Gin Val Gly Ala Asp Val Ala He Trp Tyr 

Asn Ser His Asp Ala Thr Gly Lys Ala Glu Ala Leu Ala Lys Lys 

b ° 70 75 80~ 

Gly Val Lys Val Lys Ala Tyr Lys Ala Asn Val Ser Ser Ser Asp Ala 

Val Lys Gin Thr He Glu Gin Gin He Lys Asp Phe Gly His Leu As P 
100 105 n 0 * 

He Val Val Ala Asn Ala Gly lie Pro Trp Thr Lys Gly Ala Tyr II- 

115 120 n 5 

Asp Gin Asp Asp Asp Lys His Phe Asp Gin Val Val £sp Val Asp Leu 



576 



624 



. 672 



720 



768 



816 



852 
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130 






135 




140 




Lys 


Gly 


Val 


Gly Tyr 


Val Ala 


Lys 


His Ala Gly Arg His 


Phe Arg Glu 


145 








150 




155 


160 


Arg 


Phe 


Glu 


Lys Glu 


Gly Lys 


Lys 


Gly Ala Leu Val Phe 


Thr Ala Ser 








16 5 






170 


175 


Met 


Ser Gly His He 


Val Asn 


Val 


Pro Gin Phe Gin Ala 


Ala Tyr Asn 








180 






185 


190 


Ala 


Val Lys Ala Gly 


Val Arg 


His 


Phe Ala Lys Ser Leu 


Ala Val Glu 






195 






200 


205 




Phe 


Ala 


Pro 


Phe Ala 


Arg Val 


Asn 


Ser Val Ser Pro Gly Tyr He Asn 




210 






215 




220 




Thr 


Glu 


He 


Ser Asp 


Phe Val 


Pro 


Gin Gly Thr Gin Asn* 


1 Lys Trp Trp 


225 








230 




235 


240 


Ser 


Leu 


Val 


Pro Leu 


Gly Arg Gly 


Gly Glu Thr Ala Glu 


Leu Val Gly 








245 






250 


255 


Ala 


Tyr 


Leu 


Phe Leu 


Ala Ser 


Asp 


Ala Gly Ser Tyr Ala 


Thr Gly Thr 








260 






265 


270 .V 


Asp 


He 


He 


Val Asp 


Gly Gly- Tyr 


Thr Leu Pro 








275 






280 







<210> 121 
<211> 789 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GDH S01024744 

<221> CDS 

<222> (1) . . . (789) 

<400> 121 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 43 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 
1 5 10 15 

tea ggg etc gga aag gcg atg gee att cgc ttc ggc aag gag cag gca 96 
Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe" Gly Lys Glu Gin Ala 
20 25 30 

aaa gtg gtt ate aac tat tat agt aat aaa caa gat ccg aac gag gta 144 
Lys Val Val lie Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
35 40 45 

aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 
Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 
50 55 60 

gat gtc acg aaa gag gaa gat gta aaa aat ate gtg caa acg gca att 240 
Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 80 

aag gag ttc ggc aca etc gat att atg att aat aat gee ggt ctt ,gaa 288 
Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 
85 90 .95 

aat cct gtg cca tct cac gaa atg ccg etc aag gat tgg gat aaa gtc . 3 36 
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Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asd Lvs VaT 
100 105 no 

ate ggc acg aac tta acg ggt gec ttt tta gga age cgt gaa gcq att isa 
He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala 11 = 
1*5 120 125 

aaa tat ttc gta gaa aac gat ate aag gga aat gtc att aac atg tec 
Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val lie Asn Mec Ser 
130 135 140 

agt gtg cac gaa gtg att cct tgg cca tta ttt gtc cac tat gcg qca 4 ao 
Ser Val His Glu Val lie Pro Trp Pro Leu Phe Val His Tyr S 111 
145 150 155 " iso 

agt aaa ggc ggg atg aag ctg atg aca gaa aca tta gcg ttg gaa tac s?a 
Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu iyr 
165 170 175 

gcg ccg aag ggc att cgc gtc aat aat att ggg cca ggt acq ate aac «?« 
Ala Pro Lys Gly He Arg Val Asn Asn lie G?y Pro Gly Ma He Asr 
180 185 19Q 

acg acg ate aat aag gag aaa ttt get gac cct gaa caq aga get gat 6?4 
Thr Thr lie Asn Lys Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala Asp 
195 200 205 

gta gaa age atg att cca atg gga tat ate ggc gaa ccg gag gaa ate 572 
•Val Glu Ser Met He Pro Met Gly Tyr He Gly Glu Pro Glu Glu II- 
210 215 220 " 

gee gca gta gca gec tgg ctt get teg aag gaa gec age tac gtc aca 
Ala Ala -Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr- 
225 230 235 240 

ggc ate acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea ttc 
Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
245 250 " ,55 

cag gca ggc cgc ggt taa tga 
Gin Ala Gly Arg Gly * + 
2S0 



<210> 122 
<211> 261 
<212> PRT 

<213> Artificial Sequence 

. <220> 
<223> GDH S01024744 

<400> 122 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Ala 

1 5 10 i 5 

Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin AT a 
20 25 30 



720 



753 



789 
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Lys Val Val lie Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 

35 40 45 

Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 

50 55 60 

Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala He 
65 70 75 80 

Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 

85 90 95 

Asn Pro Val Pro Ser Kis Glu Met Pro Leu Lys Asp Trp Asp Lys Val 

100 105 " lio 

He Gly Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 

115 120 " 125 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 

130 135 140 

Ser Val Kis Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 150 155 160 

Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu. Ala Leu Glu Tyr 

165 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn lie Gly Pro Gly Ala He Asn 

180 185 " 190 

Thr Thr lie Asn Lys Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala Asp 

195 200 " 205 

Val Glu Ser Met lie Pro Met Gly Tyr lie Gly Glu Pro Glu Glu lie 

210 215 ' 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr' 
225 230 235 240 

Gly lie Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 

245 250 " 255 

Gin Ala Gly Arg Gly 
260 

<210> 123 
<211> 739 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GDH S01052992 

<221> CDS 

<222> (1) . . . (789) 

<400> 123 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get get 43 
Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala lie Thr Gly Ala Ala 
15 10 15 

tea ggg etc gga aag gcg atg gee att cgc ttc ggc aag gag cag gca 96 
Ser Gly Leu Gly Lys Ala Met Ala lie Arg Phe Gly Lys Glu Gin Ala 
20 25 " ~ 30 

aaa gtg gtt ate aac tat tat agt aat aaa eaa gat ccg aac gag gta 144 
Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 
35 40 45 

aaa gaa gag gtc ate aag gcg ggc ggt gaa get gtt gtc gtc caa gga 192 
Lys Glu Glu Val lie Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 
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t 



V 



50 55 

i| e .s s a e 5 e 5 s s: a s 2 e s 

75 80 

aag gag ttc ggc aca etc qat a. u t atcr a.-*. 

Lys Glu Phe Gly Thr Leu Asp lie Mac J£ C " 933 

85 " r: Asn Asn Ala Gly Leu Glu 

95 

aat cct gtg cca tct. cac gaa atq cca et-<- = 

Asn Pro Val Pro Ser His Glu Me? Pro LeS J£ J" ^ 9tC 

100 log ieU L ^ s As P Trp Asp Lys Val 

110 

ate ggc acg aac tta acg get qcc ttt n- a „ 

nt aiy s *- i - ~ «■ «- s s s e s s: 

125 

aaa tat ttc gta gaa aac gat ate aag qQa aat- ^ ^ 
Lys Tyr Phe Val Glu Asn Asp He Lys ITy Ts n £? " C tCC 

130 13(5 * 7 Asn Val Us Asn Met Ser 

• 140 

agt gtg cac gaa gtg att cct tgg cca tta t-ht- 

Sej «a Hi, «. wl xu Pro £ Pco £ £ *J ~ £ „ „ 

160 

agt aaa ggt ggg atg aag ctq atq aca a 

Ser Lys Gly Gly Met Lyl Leu Met J£ ^hr I 9&a taC 

165 7>„ Tnr Leu Ala Glu Tyr 

175 

9cg ccg aag ggc att cgc gtc aat aat at-t- M 

*"*■ 2? IU " 9 - - " s * H S 1^ 

185 190 
acg acg ate aat aag gaq aaa ttt- n n- „ 

205 

s s* s s? s: s s s; s s s r r» s,t « •* 

210 mc 7 S Glu Pr ° Asp Glu He 

220 

K S ?" K S S S S S 51 S* 5° t9c t,c 9 - - 

225 23 ^ eU Ala Ser L ^ s Glu Ala Cys Tyr Val Thr 

235 240 

ggc ate acg tta ttc gcq aac aac aor 

«y .u », u. ». Ill Lp i? c !?= 2f s £ £ £ £ S 



250 355 



<210> 124 
<211> 251 



240 



288 



33b ' 



384 



432 



430 



528 



576 



624 



67: 



720 



763 



gca ggc cgc ggt taa tga 
Gin Ala Gly Arg Gly * * 789 
260 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> GDH S01052992 
<400> 124 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly -Ala Ala 

1 5 10 15 

Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 

20 25 -30 

Lys Val -Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro Asn Glu Val 

35 40 45 

Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 

50 55 60 

Asp Val Thr Lys Glu Glu Asp Val. Lys Asn He Val Gin Thr Ala lie 
GS~ " 70 75 80 

Lys Glu Phe Gly Thr Leu Asp -He -Met He Asn Asn Ala Gly Leu Glu 

85 90 95 

Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 

100 105 HO 

He Gly Thr Asn Leu. Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala Ha 

115 120 125 ' 

Lys Tyr Phe Val Glu Asn Asp He Lys Gly Asn Val He Asn Met Ser 

130 135 140 

Ser Val His Glu Val He Pro Trp Pro Leu Phe Val His Tyr Ala Ala 
145 150 155 160 

Ser Lys Gly Gly Met Lys Leu Met Thr Lys Thr Leu Ala Leu Glu Tyr 

165 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn lie Gly Pro Gly Ala He Asn 

180 135 190 

Thr Thr He Asn Lys Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala Asp 

195 200 205 

Val Glu Ser Met He Pro Met Gly Tyr lie Gly Glu Pro Asp Glu lie 

•210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Cys Tyr Val Thr 
225 230 235 240 

Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr* Gin Tyr Pro Ser Phe 

245 250 255 

Gin Ala Gly Arg Gly 
260 

<210> 125 

<211> 789 

<212> DNA ! 

<213> Artificial Sequence 

<220> 

<223> GDH S0.1063714 

<221> CDS 

<222> (1) . . . (789) 

<400> 125 

atg tat ccg gat tta aaa gga aaa gtc gtc get att aca gga get act 

Met Tyr Pro Asd Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Thr 
1*5 10 15 
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235 240 



144 



1S2 



288 



336 



384 



tea ggg etc gga aag gcg atq qcc att can 

s« ar ^ W I li Me ? l. ne £ £ if= £ « a e 

" L 25 30 

aaa gtg gtt ate aac tat tat agt aat aaa r=>* ™t- 
I*. V,! Val ue As„ Ty. Ty E & As. £ ~ £ ^ IS 5! S 

3= 40 4S 

aaa gaa gag gtc ate aag acq qqc aat as> 

«« G iu v,! Ile ty ? H S y 15 k £ »» «s SS S r y 

5 5 go 

S £ X S! £ S S3 K E S £f £ £ S S 2 " 

75 30 
aag gag ttc ggc aca etc gat att atq att a a t- »«- 

,y, 01. PHe sly T£ ^ Lp Ue „.? S 2. £ K I! 25 ffi 

90 . 95 

aat cct gtg cca tct cac gaa atg ccq etc aaa _ *. 

AS. Pro Val P« S .r Hi. SI. „.? Pr = L «= £ « * £ »» *. 

100 1Q 5 no 

ate ggc acg aac tta aca aat ace tt-h i-t- = 

ne oly As. to ^ l?y E £ S 5 S S I" S III 
115 120 125 

aaa tat ttc gta gaa aac gat ate aaq qaa aat ert-n . 

Tyr Pha Val 0!u As. Asp n. L yf If* £ & J£ "° £ £ ' " 

135 140 
a 9t gtg cac gaa gtg att cct tqq cca hha t-t-f- 

mj Val Bls om v,, n, p,o £ S S ££ & S £ S S 

155 .160 
agt aaa ggc ggg atg aag ctg atg aca qaa aca tf* 

Ser Ly S Oly oiy Met Lys Leu Me? Thr llu Thr Leu S Leu ?" £ 
165 170 175 

gcg ccg aag ggc att cgc gtc aat aat att gqg cca ggt acq ate aar 
.Ala Pro Lys Oly Ue Arg Val Asn Asn lie g* Pro gj g S ."n 
180 135 190 

s s s: As. s S S £ K 5 e SI a 5 s 5 

.20^ 

gta gaa age atg att cca atg gga tat ate cac gaa ce~ -aa a-a atr 
vai c-lu Ser Met lie P ro Met Oly Tyr lie oly fju ?/ Q |g ^ 

gee gca gta gca gee tgg ctt get teg aag gaa gee aac tar at-n »-» 
A a Ma val Ala Ala Trp Leu Ala Ser Ly^ J lu ^ S ^ «JJ ^ 

J 230 tt; * 



480 



528 



57b 



624 



672 
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ggc ate acg tta ttc gcg gac ggc ggt atg aca caa tat cct tea tte 768 
Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 
245 250 255 

cag gca ggc cgc ggt taa t'ga 739 
Gin Ala Gly Arg Gly * * 

'260 



<210> 126 
<211> 261 
<212> PRT 

<213> Artificial Sequence 
<220> 

< 22 3 > GDH S01063714 
<400> 126 

Met Tyr Pro Asp Leu Lys Gly Lys Val Val Ala He Thr Gly Ala Thr 

1 5 io 15 

Ser Gly Leu Gly Lys Ala Met Ala He Arg Phe Gly Lys Glu Gin Ala 

20 25 30 

Lys Val Val He Asn Tyr Tyr Ser Asn Lys Gin Asp Pro" Asn Glu Val 

35 40 45 

Lys Glu Glu Val He Lys Ala Gly Gly Glu Ala Val Val Val Gin Gly 

50 55 . 60 

Asp Val Thr Lys Glu Glu Asp Val Lys Asn He Val Gin Thr Ala lie 
65 70 75 80 

Lys Glu Phe Gly Thr Leu Asp He Met He Asn Asn Ala Gly Leu Glu 

85 90 95 

Asn Pro Val Pro Ser His Glu Met Pro Leu Lys Asp Trp Asp Lys Val 

100 105 ~ ~ no 

He Gly 'Thr Asn Leu Thr Gly Ala Phe Leu Gly Ser Arg Glu Ala lie 

115 120 ~ 125 

Lys Tyr Phe Val Glu Asn Asp lie Lys Gly Asn Val He Asn Met Ser 

130 135 140 

Ser Val His Glu Val He Pro Trp Pro' Leu Phe Val His Tyr Ala Ala 
145 150 155 igo 

Ser Lys Gly Gly Met Lys Leu Met Thr Glu Thr Leu Ala Leu Glu Tyr 

165 170 175 

Ala Pro Lys Gly He Arg Val Asn Asn lie Gly Pro Gly Ala lie Asn 

180 135 190 

Thr Thr lie Asn Lys Glu Lys Phe Ala Asp Pro Glu Gin Arg Ala Asp 

195 200 205 

Val Glu Ser Met lie Pro Met Gly Tyr He Gly Glu Pro Glu Glu He 

210 215 220 

Ala Ala Val Ala Ala Trp Leu Ala Ser Lys Glu Ala Ser Tyr Val Thr 
225 230 " 235 240 

Gly He Thr Leu Phe Ala Asp Gly Gly Met Thr Gin Tyr Pro Ser Phe 

245 250 255 

Gin Ala Gly Arg Gly 
260 
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